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Introduction to First Edition 



The opening session of the physics degree course at Imperial College includes an 
introduction to vibrations and waves where the stress is laid on the underlying unity of 
concepts which are studied separately and in more detail at later stages. The origin of this 
short textbook lies in that lecture course which the author has given for a number of years. 
Sections on Fourier transforms and non-linear oscillations have been added to extend the 
range of interest and application. 

At the beginning no more than school-leaving mathematics is assumed and more 
advanced techniques are outlined as they arise. This involves explaining the use of 
exponential series, the notation of complex numbers and partial differentiation and putting 
trial solutions into differential equations. Only plane waves are considered and, with two 
exceptions, Cartesian coordinates are used throughout. Vector methods are avoided except 
for the scalar product and, on one occasion, the vector product. 

Opinion canvassed amongst many undergraduates has argued for a ‘working’ as much as 
for a ‘reading’ book; the result is a concise text amplified by many problems over a wide 
range of content and sophistication. Hints for solution are freely given on the principle that 
an undergraduates gains more from being guided to a result of physical significance than 
from carrying out a limited arithmetical exercise. 

The main theme of the book is that a medium through which energy is transmitted via 
wave propagation behaves essentially as a continuum of coupled oscillators. A simple 
oscillator is characterized by three parameters, two of which are capable of storing and 
exchanging energy, whilst the third is energy dissipating. This is equally true of any medium. 

The product of the energy storing parameters determines the velocity of wave 
propagation through the medium and, in the absence of the third parameter, their ratio 
governs the impedance which the medium presents to the waves. The energy dissipating 
parameter introduces a loss term into the impedance; energy is absorbed from the wave 
system and it attenuates. 

This viewpoint allows a discussion of simple harmonic, damped, forced and coupled 
oscillators which leads naturally to the behaviour of transverse waves on a string, 
longitudinal waves in a gas and a solid, voltage and current waves on a transmission line 
and electromagnetic waves in a dielectric and a conductor. All are amenable to this 
common treatment, and it is the wide validity of relatively few physical principles which 
this book seeks to demonstrate. 
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H. J. PAIN 

May 1968 




Introduction to Second Edition 



The main theme of the book remains unchanged but an extra chapter on Wave Mechanics 
illustrates the application of classical principles to modern physics. 

Any revision has been towards a simpler approach especially in the early chapters and 
additional problems. Reference to a problem in the course of a chapter indicates its 
relevance to the preceding text. Each chapter ends with a summary of its important results. 

Constructive criticism of the first edition has come from many quarters, not least from 
successive generations of physics and engineering students who have used the book; a 
second edition which incorporates so much of this advice is the best acknowledgement of 
its value. 



H. J. PAIN 

June 1976 
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Introduction to Third Edition 



Since this book was first published the physics of optical systems has been a major area of 
growth and this development is reflected in the present edition. Chapter 10 has been 
rewritten to form the basis of an introductory course in optics and there are further 
applications in Chapters 7 and 8. 

The level of this book remains unchanged. 



H. J. PAIN 

January 1983 




Introduction to Fourth Edition 



Interest in non-linear dynamics has grown in recent years through the application of chaos 
theory to problems in engineering, economics, physiology, ecology, meteorology and 
astronomy as well as in physics, biology and fluid dynamics. The chapter on non-linear 
oscillations has been revised to include topics from several of these disciplines at a level 
appropriate to this book. This has required an introduction to the concept of phase space 
which combines with that of normal modes from earlier chapters to explain how energy is 
distributed in statistical physics. The book ends with an appendix on this subject. 

H. J. PAIN 

September 1992 
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Introduction to Fifth Edition 



In this edition, three of the longer chapters of earlier versions have been split in two: 
Simple Harmonic Motion is now the first chapter and Damped Simple Harmonic Motion 
the second. Chapter 10 on waves in optical systems now becomes Chapters 11 and 12, 
Waves in Optical Systems, and Interference and Diffraction respectively through a 
reordering of topics. A final chapter on non-linear waves, shocks and solitons now follows 
that on non-linear oscillations and chaos. 

New material includes matrix applications to coupled oscillations, optical systems and 
multilayer dielectric films. There are now sections on e.m. waves in the ionosphere and 
other plasmas, on the laser cavity and on optical wave guides. An extended treatment of 
solitons includes their role in optical transmission lines, in collisionless shocks in space, in 
non-periodic lattices and their connection with Schrodinger’s equation. 



H. J. PAIN 

March 1998 
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Introduction to Sixth Edition 



This edition includes new material on electron waves in solids using the Kronig - Penney 
model to show how their allowed energies are limited to Brillouin zones. The role of 
phonons is also discussed. Convolutions are introduced and applied to optical problems via 
the Array Theorem in Young's experiment and the Optical Transfer Function. In the last 
two chapters the sections on Chaos and Solutions have been reduced but their essential 
contents remain. 
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on the Optical Transfer Function. I would like to thank my wife for typing the manuscript 
of every edition except the first. 
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1 

Simple Harmonic Motion 



At first sight the eight physical systems in Figure 1 . 1 appear to have little in common. 

1.1(a) is a simple pendulum, a mass m swinging at the end of a light rigid rod of length /. 
1.1(b) is a flat disc supported by a rigid wire through its centre and oscillating through 
small angles in the plane of its circumference. 

1.1(c) is a mass fixed to a wall via a spring of stiffness s sliding to and fro in the x 
direction on a frictionless plane. 

1.1(d) is a mass m at the centre of a light string of length 21 fixed at both ends under a 
constant tension T. The mass vibrates in the plane of the paper. 

1.1(e) is a frictionless U-tube of constant cross-sectional area containing a length / of 
liquid, density p, oscillating about its equilibrium position of equal levels in each 
limb. 

1.1(f) is an open flask of volume V and a neck of length / and constant cross-sectional 
area A in which the air of density p vibrates as sound passes across the neck. 
1.1(g) is a hydrometer, a body of mass m floating in a liquid of density p with a neck of 
constant cross-sectional area cutting the liquid surface. When depressed slightly 
from its equilibrium position it performs small vertical oscillations. 

1.1(h) is an electrical circuit, an inductance L connected across a capacitance C carrying 
a charge q. 

All of these systems are simple harmonic oscillators which, when slightly disturbed from 
their equilibrium or rest postion, will oscillate with simple harmonic motion. This is the 
most fundamental vibration of a single particle or one-dimensional system. A small 
displacement x from its equilibrium position sets up a restoring force which is proportional 
to x acting in a direction towards the equilibrium position. 

Thus, this restoring force F may be written 

F = -sx 

where s, the constant of proportionality, is called the stiffness and the negative sign shows 
that the force is acting against the direction of increasing displacement and back towards 
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Simple Harmonic Motion 
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(h) 




L ^~c =° 

« 2 = f- 

Lc 



Figure 1.1 Simple harmonic oscillators with their equations of motion and angular frequencies lo of 
oscillation, (a) A simple pendulum, (b) A torsional pendulum, (c) A mass on a frictionless plane 
connected by a spring to a walL. (d) A mass at the centre of a string under constant tension T. (e) A 
fixed length of non-viscous liquid in a U-tube of constant cross-section, (f) An acoustic Helmholtz 
resonator, (g) A hydrometer mass m in a liquid of density p. (h) An electrical L C resonant circuit 



the equilibrium position. A constant value of the stiffness restricts the displacement x to 
small values (this is Hooke’s Law of Elasticity). The stiffness .9 is obviously the restoring 
force per unit distance (or displacement) and has the dimensions 

force _ MLT~ 2 
distance L 

The equation of motion of such a disturbed system is given by the dynamic balance 
between the forces acting on the system, which by Newton’s Law is 

mass times acceleration = restoring force 



or 



nix = — sx 



where the acceleration 



x = 



d 2 x 

At 2 



This gives 



nix + sx = 0 
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or 



—x — 0 
m 



where the dimensions of 



5 MLT- 

— are 

m ML 



= T~ 2 = a 2 



Here T is a time, or period of oscillation, the reciprocal of v which is the frequency with 
which the system oscillates. 

However, when we solve the equation of motion we shall find that the behaviour of x 
with time has a sinusoidal or cosinusoidal dependence, and it will prove more appropriate 
to consider, not v, but the angular frequency u) — 2 tw so that the period 



T = 1 = 2tt 
v 



where s/m is now written as ur. Thus the equation of simple harmonic motion 



—x = 0 
m 



becomes 



x + lo 2 x = 0 



( 1 . 1 ) 



(Problem 1.1) 

Displacement in Simple Harmonic Motion 

The behaviour of a simple harmonic oscillator is expressed in terms of its displacement x 
from equilibrium, its velocity x, and its acceleration x at any given time. If we try the solution 

x = A cos cut 

where A is a constant with the same dimensions as x, we shall find that it satisfies the 
equation of motion 

x + lo 2 x = 0 



for 



x = — Auj sin Lot 



x = — Alo 2 coscut = —lu 2 x 



and 
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Another solution 



x — B sin Lot 

is equally valid, where B has the same dimensions as A, for then 

x = Boo cos c ot 



and 



x = —Blo 2 sin tut = — tu 2 x 

The complete or general solution of equation (1.1) is given by the addition or 
superposition of both values for x so we have 

x = A cos Lot + B sin tut (1-2) 



with 



x = — tu 2 (A cos tut + B sin tut) = — tu 2 x 

where A and B are determined by the values of x and x at a specified time. If we rewrite the 
constants as 



A = a sin and B = a cos cj) 
where (j) is a constant angle, then 

A 2 + B 2 = a 2 (sin 2 (j) + cos 2 cj)) — a 2 



so that 



a — V A 2 + B 2 

and 



x = a sin (f) cos tut + a cos q i sin tut 
= a sin (tut + 4>) 

The maximum value of sin (tut + cf>) is unity so the constant a is the maximum value of x, 
known as the amplitude of displacement. The limiting values of sin (tut + (j>) are ± 1 so the 
system will oscillate between the values of x = ±a and we shall see that the magnitude of a 
is determined by the total energy of the oscillator. 

The angle 0 is called the 'phase constant’ for the following reason. Simple harmonic 
motion is often introduced by reference to ‘circular motion' because each possible value of 
the displacement x can be represented by the projection of a radius vector of constant 
length a on the diameter of the circle traced by the tip of the vector as it rotates in a positive 




6 



Simple Harmonic Motion 




Figure 1.2 Sinusoidal displacement of simpLe harmonic oscillator with time, showing variation of 
starting point in cycle in terms of phase angle 4> 



anticlockwise direction with a constant angular velocity oj. Each rotation, as the radius 
vector sweeps through a phase angle of 2 tt rad, therefore corresponds to a complete 
vibration of the oscillator. In the solution 

x — a sin (c ot + cf>) 

the phase constant <f>, measured in radians, defines the position in the cycle of oscillation at 
the time t — 0, so that the position in the cycle from which the oscillator started to move is 



x = a sin <p 



The solution 



x — a sin tot 

defines the displacement only of that system which starts from the origin x = 0 at time 
t = 0 but the inclusion of 0 in the solution 

x — a sin (cot + </>) 

where 0 may take all values between zero and In allows the motion to be defined from any 
starting point in the cycle. This is illustrated in Figure 1.2 for various values of (j). 

(Problems 1.2, 1.3, 1.4, 1.5) 



Velocity and Acceleration in Simple Harmonic Motion 

The values of the velocity and acceleration in simple harmonic motion for 

x = a sin (t ot + cf>) 



are given by 



dx 

dr 



x = atucos (cat + (j>) 
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and 



d 2 x 

d H =X = 



—auj 2 sin (cot + 4>) 



The maximum value of the velocity aut is called the velocity amplitude and the 
acceleration amplitude is given by au> 2 . 

From Figure 1.2 we see that a positive phase angle of 7r/2 rad converts a sine into a 
cosine curve. Thus the velocity 

x = aco cos (t ot + q i) 

leads the displacement 



x = a sin(tut + (/>) 

by a phase angle of 7r/2 rad and its maxima and minima are always a quarter of a cycle 
ahead of those of the displacement; the velocity is a maximum when the displacement is 
zero and is zero at maximum displacement. The acceleration is ‘anti-phase’ (7r rad) with 
respect to the displacement, being maximum positive when the displacement is maximum 
negative and vice versa. These features are shown in Figure 1.3. 

Often, the relative displacement or motion between two oscillators having the same 
frequency and amplitude may be considered in terms of their phase difference (f>\ — (j >2 
which can have any value because one system may have started several cycles before the 
other and each complete cycle of vibration represents a change in the phase angle of 
(f) — 2i r. When the motions of the two systems are diametrically opposed; that is, one has 




Figure 1.3 Variation with time of displacement, veLocity and acceleration in simple harmonic 
motion. Displacement Lags veLocity by 7r/2 rad and is 7r rad out of phase with the acceleration. The 
initial phase constant 4 > is taken as zero 
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x=+a whilst the other is at x = —a, the systems are ‘anti-phase’ and the total phase 
difference 

<pi — <p 2 = nt trad 

where n is an odd integer. Identical systems ‘in phase’ have 

<j>i — <j>2 — 2«7rrad 

where n is any integer. They have exactly equal values of displacement, velocity and 
acceleration at any instant. 

(Problems 1.6, 1.7, 1.8, 1.9) 

Non-linearity 

If the stiffness s is constant, then the restoring force F = — sx, when plotted versus x, will 
produce a straight line and the system is said to be linear. The displacement of a linear 
simple harmonic motion system follows a sine or cosine behaviour. Non-linearity results 
when the stiffness s is not constant but varies with displacement x (see the beginning of 
Chapter 14). 

Energy of a Simple Harmonic Oscillator 

The fact that the velocity is zero at maximum displacement in simple harmonic motion and 
is a maximum at zero displacement illustrates the important concept of an exchange 
between kinetic and potential energy. In an ideal case the total energy remains constant but 
this is never realized in practice. If no energy is dissipated then all the potential energy 
becomes kinetic energy and vice versa, so that the values of (a) the total energy at any time, 
(b) the maximum potential energy and (c) the maximum kinetic energy will all be equal; 
that is 



E total = KE + PE = KE max = PE max 

The solution x = a sin (cot + <p) implies that the total energy remains constant because the 
amplitude of displacement x = ±a is regained every half cycle at the position of maximum 
potential energy; when energy is lost the amplitude gradually decays as we shall see later in 
Chapter 2. The potential energy is found by summing all the small elements of work sx. dx 
(force sx times distance dx) done by the system against the restoring force over the range 
zero to x where x = 0 gives zero potential energy. 

Thus the potential energy = 

sx ■ dx = \sx 2 

. 0 

The kinetic energy is given by 'wx 2 so that the total energy 

E = \mx 2 + \sx 2 
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Since E is constant we have 

AE . 

—— = ( mx + sx)x = 0 
At 

giving again the equation of motion 

mix + sx = 0 

The maximum potential energy occurs at x — Az a and is therefore 

PE max “ 9 



The maximum kinetic energy is 



KEmax = ({mx 2 ) max = \ma 2 cu 2 [cos 2 (cut + (f >)] 1 



1 ,2 



when the cosine factor is unity. 

But mcu 2 = s so the maximum values of the potential and kinetic energies are equal, 
showing that the energy exchange is complete. 

The total energy at any instant of time or value of x is 



E = \ mx 2 + \ sx 2 

= ^ma 2 uu 2 [cos 2 (cut +</>) + sin 2 (iut + (f>)\ 

1 2 2 
= 2 mau) 



as we should expect. 

Figure 1.4 shows the distribution of energy versus displacement for simple harmonic 
motion. Note that the potential energy curve 

PE = i sx 2 — \ nicrcu 2 sin 2 (u;t + f) 

is parabolic with respect to x and is symmetric about x = 0, so that energy is stored in the 
oscillator both when x is positive and when it is negative, e.g. a spring stores energy 
whether compressed or extended, as does a gas in compression or rarefaction. The kinetic 
energy curve 

KE = jmi 2 = \ma 2 u 2 cos 2 (cut + (j>) 

is parabolic with respect to both x and jc. The inversion of one curve with respect to the 
other displays the 7r/2 phase difference between the displacement (related to the potential 
energy) and the velocity (related to the kinetic energy). 

For any value of the displacement x the sum of the ordinates of both curves equals the 
total constant energy E. 
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Total energy E= KE + PE 




Figure 1.4 Parabolic representation of potential energy and kinetic energy of simpLe harmonic 
motion versus displacement. Inversion of one curve with respect to the other shows a 90° phase 
difference. At any displacement value the sum of the ordinates of the curves equals the total 
constant energy E 

(Problems 1.10, 1.11, 1.12) 



Simple Harmonic Oscillations in an Electrical System 

So far we have discussed the simple harmonic motion of the mechanical and fluid systems 
of Figure 1.1, chiefly in terms of the inertial mass stretching the weightless spring of 
stiffness .v. The stiffness 5 of a spring defines the difficulty of stretching; the reciprocal of 
the stiffness, the compliance C (where s — 1/C) defines the ease with which the spring is 
stretched and potential energy stored. This notation of compliance C is useful when 
discussing the simple harmonic oscillations of the electrical circuit of Figure 1.1(h) and 
Figure 1.5, where an inductance L is connected across the plates of a capacitance C. The 
force equation of the mechanical and fluid examples now becomes the voltage equation 



L 





Figure 1.5 Electrical system which oscillates simpLe harmonicaLLy. The sum of the voltages around 
the circuit is given by Kirchhoffs Law as Ldl/dt + q/C = 0 
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(balance of voltages) of the electrical circuit, but the form and solution of the equations and 
the oscillatory behaviour of the systems are identical. 

In the absence of resistance the energy of the electrical system remains constant and is 
exchanged between the magnetic field energy stored in the inductance and the electric field 
energy stored between the plates of the capacitance. At any instant, the voltage across the 
inductance is 



V = 





where 1 is the current flowing and q is the charge on the capacitor, the negative sign 
showing that the voltage opposes the increase of current. This equals the voltage q/C 
across the capacitance so that 



L'q + q/C = 0 



(Kirchhoff s Law) 



or 



q + t v 2 q = 0 

where 



The energy stored in the magnetic field or inductive part of the circuit throughout the 
cycle, as the current increases from 0 to /, is formed by integrating the power at any instant 
with respect to time; that is 



E L = 



VI dt 



(where V is the magnitude of the voltage across the inductance). 
So 



E L = 



V7dr = 



dl rI 
L — I dt = 
dt 



o 



= i L/ 2 = iL«7 2 



Lldl 



The potential energy stored mechanically by the spring is now stored electrostatically by 
the capacitance and equals 
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Comparison between the equations for the mechanical and electrical oscillators 



mechanical (force) — >■ nix + sx = 0 

q 

electrical (voltage) — > L'q -\ — = 0 



mechanical (energy) — »• \mx 2 + \sx 2 = E 

1 i 1 q 2 

electrical (energy) -Lq 2 + - — = E 



shows that magnetic field inertia (defined by the inductance L) controls the rate of change 
of current for a given voltage in a circuit in exactly the same way as the inertial mass 
controls the change of velocity for a given force. Magnetic inertial or inductive behaviour 
arises from the tendency of the magnetic flux threading a circuit to remain constant and 
reaction to any change in its value generates a voltage and hence a current which flows to 
oppose the change of flux. This is the physical basis of Fleming’s right-hand rule. 



Superposition of Two Simple Harmonic Vibrations in One 
Dimension 

(1) Vibrations Having Equal Frequencies 

In the following chapters we shall meet physical situations which involve the superposition 
of two or more simple harmonic vibrations on the same system. 

We have already seen how the displacement in simple harmonic motion may be 
represented in magnitude and phase by a constant length vector rotating in the positive 
(anticlockwise) sense with a constant angular velocity u>. To find the resulting motion of a 
system which moves in the x direction under the simultaneous effect of two simple 
harmonic oscillations of equal angular frequencies but of different amplitudes and phases, 
we can represent each simple harmonic motion by its appropriate vector and carry out a 
vector addition. 

If the displacement of the first motion is given by 

X\ = a \ cos (c ot + (j> i) 

and that of the second by 

x 2 = a 2 cos (ivt+ <f> 2 ) 

then Figure 1 .6 shows that the resulting displacement amplitude R is given by 

R 2 = (a i + a 2 cosS)~ + (a 2 sin<5) 2 
— a | T a^, T 2 a \ a 2 cos S 



where 6 = (j> 2 — <j>i is constant. 
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Figure 1.6 Addition of vectors, each representing simple harmonic motion along the x axis at 
angular frequency w to give a resulting simple harmonic motion displacement x = R cos (cut + 0) — 
here shown for t = 0 



The phase constant 6 of R is given by 

„ a i sin dq + U2 suu/>2 

tan 9 = — 

a i cos ip] + a 2 cos <p 2 

so the resulting simple harmonic motion has a displacement 

x = R cos (t at + 9 ) 

an oscillation of the same frequency uj but having an amplitude R and a phase constant 9 . 

(Problem 1.13) 

(2) Vibrations Having Different Frequencies 

Suppose we now consider what happens when two vibrations of equal amplitudes but 
different frequencies are superposed. If we express them as 

x\ = a since it 



and 



X2 — a since2t 



where 



Ct >2 > ce 1 
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then the resulting displacement is given by 



x = x\ + X 2 = a(sintuif + sincu 2 t) 



= 2 a sin 



[u)\ + (U 2 )f 
2 



cos 



(w 2 - Wi )t 
2 



This expression is illustrated in Figure 1.7. It represents a sinusoidal oscillation at the 
average frequency (uj \ + 2 ) / 2 having a displacement amplitude of 2 a which modulates; 
that is, varies between 2 a and zero under the influence of the cosine term of a much slower 
frequency equal to half the difference (cu 2 — uq)/2 between the original frequencies. 

When uj \ and uii are almost equal the sine term has a frequency very close to both uj \ 
and l c >2 whilst the cosine envelope modulates the amplitude 2 a at a frequency (cu 2 — u>\)/ 2 
which is very slow. 

Acoustically this growth and decay of the amplitude is registered as ‘beats’ of strong 
reinforcement when two sounds of almost equal frequency are heard. The frequency of the 
‘beats’ is (u; 2 — cu i), the difference between the separate frequencies (not half the 
difference) because the maximum amplitude of 2 a occurs twice in every period associated 
with the frequency (tu 2 — uj i)/ 2. We shall meet this situation again when we consider 
the coupling of two oscillators in Chapter 4 and the wave group of two components in 
Chapter 5. 




Figure 1.7 Superposition of two simpLe harmonic displacements Xy = a sin ujyt and x 2 = a sin ui 2 t 
when uj 2 >coy. The slow cos [(w 2 - w i) / 2]t envelope modulates the sin [(u> 2 + to y) / 2}t curve 
between the values x = ±2o 
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Superposition of Two Perpendicular Simple Harmonic 
Vibrations 

(1) Vibrations Having Equal Frequencies 

Suppose that a particle moves under the simultaneous influence of two simple harmonic 
vibrations of equal frequency, one along the x axis, the other along the perpendicular y axis. 
What is its subsequent motion? 

This displacements may be written 

x = a i sin [uut + <j>\) 
y = a2 sin (art + $2) 

and the path followed by the particle is formed by eliminating the time t from these 
equations to leave an expression involving only x and y and the constants (t> \ and fo. 
Expanding the arguments of the sines we have 

x . . , 

— = sin uit cos (p 1 + cos u>t sin <p 1 
a 1 



and 



a 2 



= sin cut cos cf>2 + cos cut sin <f > 2 



If we carry out the process 



this will yield 



( x ■ * y ■ V , ( y , * \ 

— sin 02 sin (pi + — cos (pi cosip2 

\«i «2 / \a 2 a 1 J 



X^ y- 2 xv 0 

-2 +^-2 - COS (02 - ^ 1 ) = sm 2 (4>2 - fl) 

a\ a 2 a\a,2 



( 1 . 3 ) 



which is the general equation for an ellipse. 

In the most general case the axes of the ellipse are inclined to the x and y axes, but these 
become the principal axes when the phase difference 



>2 — <Pi = 



Equation ( 1 . 3 ) then takes the familiar form 



- + ^=1 
2 ' 2 1 
a f 



that is, an ellipse with semi-axes a\ and a 2. 
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If a i = a 2 = a this becomes the circle 



2,2 2 

x + _y = a 



When 



<t>2 ~ <t> i = 0, 27T, 47 r, etc. 



the equation simplifies to 



a 2 

y = —x 
a i 



which is a straight line through the origin of slope a^l a 
Again for <j >2 — (j)\ = it, 37t, 57 t, etc., we obtain 



y = - 



a 2 
— x 



a i 



a straight line through the origin of equal but opposite slope. 

The paths traced out by the particle for various values of 6 — 4>2 — 4>i are shown in 
Figure 1.8 and are most easily demonstrated on a cathode ray oscilloscope. 

When 



^2 — <t>i = 0, 7r, 27 r, etc. 

and the ellipse degenerates into a straight line, the resulting vibration lies wholly in one 
plane and the oscillations are said to be plane polarized. 




+ 



3 



'(/) 

CU 

II 



S = 0 8 = -5 8=| S = 5T 5 = tc 




x= a sin (tof + 0 1 ) 02-0, =8 



Figure 1.8 Paths traced by a system vibrating simultaneously in two perpendicular directions with 
simple harmonic motions of equal frequency. The phase angle 6 is the angle by which they motion 
leads the x motion 
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Convention defines the plane of polarization as that plane perpendicular to the plane 
containing the vibrations. Similarly the other values of 



yield circular or elliptic polarization where the tip of the vector resultant traces out the 
appropriate conic section. 

(Problems 1.14, 1.15, 1.16) 

* Polarization 

Polarization is a fundamental topic in optics and arises from the superposition of two 
perpendicular simple harmonic optical vibrations. We shall see in Chapter 8 that when a 
light wave is plane polarized its electrical field oscillation lies within a single plane and 
traces a sinusoidal curve along the direction of wave motion. Substances such as quartz and 
calcite are capable of splitting light into two waves whose planes of polarization are 
perpendicular to each other. Except in a specified direction, known as the optic axis, these 
waves have different velocities. One wave, the ordinary or O wave, travels at the same 
velocity in all directions and its electric field vibrations are always perpendicular to the 
optic axis. The extraordinary or E wave has a velocity which is direction-dependent. Both 
ordinary and extraordinary light have their own refractive indices, and thus quartz and 
calcite are known as doubly refracting materials. When the ordinary light is faster, as in 
quartz, a crystal of the substance is defined as positive, but in calcite the extraordinary light 
is faster and its crystal is negative. The surfaces, spheres and ellipsoids, which are the loci 
of the values of the wave velocities in any direction are shown in Figure 1.9(a), and for a 



Optic axis Optic axis 





Figure 1.9a Ordinary (spherical) and extraordinary (elLiposoidal) wave surfaces in doubLy refracting 
calcite and quartz. In calcite the E wave is faster than the 0 wave, except aLong the optic axis. In 
quartz the 0 wave is faster. The 0 vibrations are aLways perpendicular to the optic axis, and the 0 and 
E vibrations are aLways tangential to their wave surfaces 



*This section may be omitted at a first reading. 
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Plane polarized 
light normally 
incident 



Calcite 

crystal 




O vibration 
± to plane of paper 



E vibration 



Optic 

axis 



Figure 1.9b Plane polarized Light normally incident on a calcite crystal face cut parallel to its optic 
axis. The advance of the E wave over the 0 wave is equivalent to a gain in phase 



given direction the electric field vibrations of the separate waves are tangential to the 
surface of the sphere or ellipsoid as shown. Figure 1.9(b) shows plane polarized light 
normally incident on a calcite crystal cut parallel to its optic axis. Within the crystal the 
faster E wave has vibrations parallel to the optic axis, while the O wave vibrations are 
perpendicular to the plane of the paper. The velocity difference results in a phase gain of 
the E vibration over the O vibration which increases with the thickness of the crystal. 
Figure 1.9(c) shows plane polarized light normally incident on the crystal of Figure 1.9(b) 
with its vibration at an angle of 45° of the optic axis. The crystal splits the vibration into 




resulting electric 
field vector 

Figure 1.9c The crystal of Fig. 1.9c is thick enough to produce a phase gain of 7r/2 rad in the 
E wave over the 0 wave. Wave recombination on leaving the crystal produces circularly polarized 
light 
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equal E and O components, and for a given thickness the E wave emerges with a phase gain 
of 90° over the O component. Recombination of the two vibrations produces circularly 
polarized light, of which the electric field vector now traces a helix in the anticlockwise 
direction as shown. 



(2) Vibrations Having Different Frequencies ( Lissajous Figures ) 

When the frequencies of the two perpendicular simple harmonic vibrations are not equal 
the resulting motion becomes more complicated. The patterns which are traced are called 
Lissajous figures and examples of these are shown in Figure 1.10 where the axial 
frequencies bear the simple ratios shown and 

6 — <j >2 — 4>\ =0 (on the left) 

7T 

= — (on the right) 

If the amplitudes of the vibrations are respectively a and b the resulting Lissajous figure 
will always be contained within the rectangle of sides 2 a and 2b. The sides of the rectangle 
will be tangential to the curve at a number of points and the ratio of the numbers of these 
tangential points along the x axis to those along the y axis is the inverse of the ratio of the 
corresponding frequencies (as indicated in Figure 1.10). 



8 = — 







Figure 1.10 Simple Lissajous figures produced by perpendicular simple harmonic motions of 
different angular frequencies 
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Superposition of a Large Number a? of Simple Harmonic Vibrations 
of Equal Amplitude a and Equal Successive Phase Difference 8 

Figure 1.11 shows the addition of n vectors of equal length a, each representing a simple 
harmonic vibration with a constant phase difference 6 from its neighbour. Two general 
physical situations are characterized by such a superposition. The first is met in Chapter 5 
as a wave group problem where the phase difference 6 arises from a small frequency 
difference, 6cu, between consecutive components. The second appears in Chapter 12 where 
the intensity of optical interference and diffraction patterns are considered. There, the 
superposed harmonic vibrations will have the same frequency but each component will have 
a constant phase difference from its neighbour because of the extra distance it has travelled. 

The figure displays the mathematical expression 

R cos (cut + a) = a cos cut + a cos (cut + S) + a cos (cut + 26) 

+ •••+« cos (cut + [n — 1]<5) 




Figure 1.11 Vector superposition of a large number n of simple harmonic vibrations of equal 
amplitude a and equal successive phase difference 6. The amplitude of the resultant 



_ „ . nS 

R = 2rsin — = a 
2 



si nn6/2 
sin 6/2 



and its phase with respect to the first contribution is given by 

a = (n — 1)6/2 
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where R is the magnitude of the resultant and a is its phase difference with respect to the 
first component a cos c ot. 

Geometrically we see that each length 



where r is the radius of the circle enclosing the (incomplete) polygon. 
From the isosceles triangle OAC the magnitude of the resultant 



R = 2 r sin 



116 sin n6 /2 

2 sin 6/2 



and its phase angle is seen to be 

a = OAB - OAC 



In the isosceles triangle OAC 

nS 

OAC = 90 ° - — 
2 



and in the isosceles triangle OAB 



so 



OAB = 90 ° 



6 

2 





( n 



1 ) 



6 

2 



that is, half the phase difference between the first and the last contributions. Hence the 
resultant 



sin n8/2 

R cos (cot + a) — a — — — — cos 
v ’ sin 6/2 

We shall obtain the same result later in this chapter as an example on the use of exponential 
notation. 

For the moment let us examine the behaviour of the magnitude of the resultant 

sinn< 5/2 

R = a ■ x/-> 
sin 0/2 

which is not constant but depends on the value of 6. When n is very large 6 is very small 
and the polygon becomes an arc of the circle centre O , of length na = A, with R as the 
chord. Then 



iot+(n - 1 )- 



/ u _ nS 
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Figure 1.12 (a) Graph of A sin a/a versus a, showing the magnitude of the resultants for (b) 

a = 0; (c) a = 7 r / 2 ; (d) a = tv and (e) a = 3tt/2 



and 




6 

2 



a 

n 



Hence, in this limit, 



smn6/2 sincr sin a A sin a 

R — a — — — — = a — — = na = 

sinn/2 a/n a a 

The behaviour of A sin a/ a versus a is shown in Figure 1.12. The pattern is symmetric 
about the value a = 0 and is zero whenever sin a = 0 except at a — > 0 that is, when sin 
a/ a — > 1. When a = 0, 6 = 0 and the resultant of the n vectors is the straight line of length 
A, Figure 1.12(b). As 6 increases A becomes the arc of a circle until at a = ir/2 the first and 
last contributions are out of phase (2a = 7 r) and the arc A has become a semicircle of 
which the diameter is the resultant R Figure 1.12(c). A further increase in 6 increases a and 
curls the constant length A into the circumference of a circle (a = n) with a zero resultant. 
Figure 1.12(d). At a = 37t/2, Figure 1.12(e) the length A is now 3/2 times the 
circumference of a circle whose diameter is the amplitude of the first minimum. 

*Superposition of n Equal SHM Vectors of Length a with 
Random Phase 

When the phase difference between the successive vectors of the last section may take 
random values </> between zero and 27 t (measured from the x axis) the vector superposition 
and resultant R may be represented by Figure 1.13. 

*This section may be omitted at a first reading. 
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is important in optical incoherence and in energy loss from waves from random dissipation processes 



The components of R on the x and y axes are given by 

R x = a cos <f> i + a cos <p 2 + a cos ^3 ... a cos (f>„ 

n 

= a cos fi 

i= 1 

and 



n 

R y = a sinfj 

i= 1 



where 



R 2 



Rl+R 



2 

y 



Now 



/ n ' 


2 

\ 


n n 


n 


R^ = a 2 ( cos fj 

V i=l y 


= « 2 


cos 2 ft + 

i= 1 1 

L m 


COS ^53 COS (f)j 
1=1 



In the typical term 2 cos <b, cos of the double summation, cos ci>, and cos <pj have random 
values between ± 1 and the averaged sum of sets of these products is effectively zero. 
The summation 



n 

cos 2 fi = n cos 2 f 

i= 1 
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that is, the number of terms n times the average value cos 2 <j) which is the integrated value 
of cos 2 cf> over the interval zero to 2-7T divided by the total interval 27 t, or 



cos 2 d> = 



1 

2ir 



2tt 



i 



cos 2 (f>d(/) = — = sin 2 < 



So 



n 

R 2 = a 2 cos 2 cf>i = na 2 cos 2 
i=\ 



2 

no 

~Y 



and 



9 9 V > . 9 9.9. AtCt 

7 ? y = a 9, sin (j)i — na sin 

1=1 2 

giving 

7 ? 2 = 7? 2 + 7? 2 = n< 3 2 
or 

7? = V^ 7 

Thus, the amplitude 7 ? of a system subjected to n equal simple harmonic motions of 
amplitude a with random phases in only \fna whereas, if the motions were all in phase R 
would equal na. 

Such a result illustrates a very important principle of random behaviour. 

(Problem 1.17) 

Applications 

Incoherent Sources in Optics The result above is directly applicable to the problem of 
coherence in optics. Light sources which are in phase are said to be coherent and this 
condition is essential for producing optical interference effects experimentally. If the 
amplitude of a light source is given by the quantity a its intensity is proportional to a 2 , n 
coherent sources have a resulting amplitude na and a total intensity n 2 a 2 . Incoherent 
sources have random phases, n such sources each of amplitude a have a resulting amplitude 
y/na and a total intensity of na 2 . 

Random Processes and Energy Absorption From our present point of view the 
importance of random behaviour is the contribution it makes to energy loss or absorption 
from waves moving through a medium. We shall meet this in all the waves we discuss. 
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Random processes, for example collisions between particles, in Brownian motion, are of 
great significance in physics. Diffusion, viscosity or frictional resistance and thermal 
conductivity are all the result of random collision processes. These energy dissipating 
phenomena represent the transport of mass, momentum and energy, and change only in the 
direction of increasing disorder. They are known as ‘thermodynamically irreversible’ 
processes and are associated with the increase of entropy. Heat, for example, can flow only 
from a body at a higher temperature to one at a lower temperature. Using the earlier 
analysis where the length a is no longer a simple harmonic amplitude but is now the 
average distance a particle travels between random collisions (its mean free path), we see 
that after n such collisions (with, on average, equal time intervals between collisions) the 
particle will, on average, have travelled only a distance sffia from its position at time t — 0, 
so that the distance travelled varies only with the square root of the time elapsed instead of 
being directly proportional to it. This is a feature of all random processes. 

Not all the particles of the system will have travelled a distance sjna but this distance is 
the most probable and represents a statistical average. 

Random behaviour is described by the diffusion equation (see the last section of 
Chapter 7) and a constant coefficient called the diffusivity of the process will always 
arise. The dimensions of a diffusivity are always length 2 /time and must be interpreted in 
terms of a characteristic distance of the process which varies only with the square root of 
time. 



Some Useful Mathematics 

The Exponential Senes 

By a ‘natural process’ of growth or decay we mean a process in which a quantity changes 
by a constant fraction of itself in a given interval of space or time. A 5% per annum 
compound interest represents a natural growth law; attenuation processes in physics usually 
describe natural decay. 

The law is expressed differentially as 

dN d N , , 

— = ±adi or — = ± a at 
N N 



where N is the changing quantity, a is a constant and the positive and negative signs 
represent growth and decay respectively. The derivatives dN/dx or dN/dt are therefore 
proportional to the value of N at which the derivative is measured. 

Integration yields N = N oe ±ax or N = N oe ±ot where No is the value at x or t — 0 and e 
is the exponential or the base of natural logarithms. The exponential series is defined as 



2 3 n 

v , X X D X 

6 =1+X+ 2! + 3! + "' + ^! + 



and is shown graphically for positive and negative x in Figure 1.14. It is important to note 
that whatever the form of the index of the logarithmic base e, it is the power to which the 
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Figure 1.14 The behaviour of the exponential series y = e x and y = e x 



base is raised, and is therefore always non-dimensional. Thus e ax is non-dimensional and a 
must have the dimensions of x' 1 . Writing 



e“ r = 1 + ox - 



(ax) 2 (ax ) 3 



2 ! 



3! 



it follows immediately that 
d 



, 2a 2 3a 3 2 

d * (e ' ) - a+ ^r* + ^r x ~ + 



= a 



1 + (XX H- 



(ax) 2 (ax) 3N 



2 ! 



3! 



= ae 



Similarly 



d 2 



2 e ax 



In Chapter 2 we shall use d(e Qf )/d t = a e at and d 2 (e“')/df 2 = a 2 e at on a number of 
occasions. 

By taking logarithms it is easily shown that e x e y = e x+y since log e (e A e v ) = 
loge e x + loge e v = x + y. 



The Notation i = \[^\ 

The combination of the exponential series with the complex number notation i = y/—l is 
particularly convenient in physics. Here we shall show the mathematical convenience in 
expressing sine or cosine (oscillatory) behaviour in the form e“ = cosx + isinx. 
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In Chapter 3 we shall see the additional merit of i in its role of vector operator. 
The series representation of sin x is written 



sinr — x — 



3! 



5! 7! 



and that of cos x is 



cos* = 1 



X X 

2[ + 4[ 



x 



6 



6 ! 



Since 



etc. we have 



i = \/— I, i 2 = — 1 , i 3 = -i 



= 1 
= 1 



IX 



(be ) 2 (lt) 3 (be ) 4 



2 ! 

ci 



3! 



X" IX 

U “ 2! “ DT 



x 

' 4[ 



, x 2 x 4 
- 1_ 2! + 4!' 



x 



4! 



X J 

5! 



+ 



= cosx + isinx 



We also see that 

— (e“) = ie LT = icosx — sinx 
dx 

Often we shall represent a sine or cosine oscillation by the form e LX and recover the original 
form by taking that part of the solution preceded by i in the case of the sine, and the real 
part of the solution in the case of the cosine. 



Examples 

(1 ) In simple harmonic motion (x + uj 2 x = 0) let us try the solution x = a e ,UJI e l4 ', where a 
is a constant length, and f (and therefore is a constant. 



dx 
d t 
d 2 x 
dr 2 



= x = i u>a e lut = it ox 



= x = i 2 ca 2 ae luir e lcl> — —u> 2 x 



Therefore 



r=fle i “'e i ^ = fle i ^ 

= a cos (cot + <p) + i a sin (cot + f) 



is a complete solution of x + u; 2 x = 0. 
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On p. 6 we used the sine form of the solution; the cosine form is equally valid and merely 
involves an advance of 7r/2 in the phase <b. 

(2) 



= 2 





e — e 





= 2cosx 
= 2isinx 



(3) On p. 21 we used a geometrical method to show that the resultant of the superposed 
harmonic vibrations 



a cos wt + a cos (cut + 6) + a cos (cut + 28) + 
sin;z6/2 ( fn — l x 



+ a cos (cut + [n — 1]6) 



sin 8/2 



cos < cut + 



We can derive the same result using the complex exponential notation and taking the real 
part of the series expressed as the geometrical progression 



a e ia>t + a e i(w?+,5) + a e i(wf+25) + • ■ ■ + a e i[ ^ +(n “ 1)51 
= ae iwr (l+z + z 2 + ---+z("“ 1) ) 



where z = e lS - 
Writing 



and 



we have 



So 



S(z) = l+z + z 2 + ---+z n - 1 



z[S(z)] — z + z} + • • • + z" 




1 - e inS 
1 - e' 6 



ae iw, S(z ) 



a e 



1 - e i,,s 
1 - e w 



at 



^m8/2^—\n8/2 ^m8/2^ 

Q i8/2^-i8/2 _ e i<5/2) 



sin, "V 2 

sin 8/2 
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with the real part 



= a cos 



, n - 1 

tot + ( — — — ) 6 



sinub/2 
sin 6/2 



which recovers the original cosine term from the complex exponential notation. 

(Problem 1.18) 

(4) Suppose we represent a harmonic oscillation by the complex exponential form 

z = ae lut 

where a is the amplitude. Replacing i by — i defines the complex conjugate 



z = ae 



The use of this conjugate is discussed more fully in Chapter 3 but here we can note that the 
product of a complex quantity and its conjugate is always equal to the square of the 
amplitude for 



zz* =a 2 e iwf e- iw, = a 2 e (i - i)wf = a 2 e° 



= a~ 



(Problem 1.19) 



Problem 1.1 

The equation of motion 



.. . . t u 

mx — —sx with lo = — 
m 

applies directly to the system in Figure 1.1(c). 

If the pendulum bob of Figure 1.1(a) is displaced a small distance x show that the stiffness (restoring 
force per unit distance) is mg/l and that w 1 = g/I where g is the acceleration due to gravity. Now use 
the small angular displacement 6 instead of x and show that ui is the same. 

In Figure 1.1(b) the angular oscillations are rotational so the mass is replaced by the moment of 
inertia I of the disc and the stiffness by the restoring couple of the wire which is C rad~' of angular 
displacement. Show that w 2 = C/I. 

In Figure 1.1(d) show that the stiffness is IT /l and that uj 2 = 2 T /1m. 

In Figure 1.1(e) show that the stiffness of the system in 2 pAg, where A is the area of cross section 
and that u> 2 = 2 g/l where g is the acceleration due to gravity. 
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In Figure 1.1(f) only the gas in the flask neck oscillates, behaving as a piston of mass pAl. If the 
pressure changes are calculated from the equation of state use the adiabatic relation pV 1 = constant 
and take logarithms to show that the pressure change in the flask is 

dV Ax 

dp = -7p— = - 7 p—, 

where x is the gas displacement in the neck. Hence show that u 2 = 'ypA/lpV. Note that 7 p is the 
stiffness of a gas (see Chapter 6 ). 

In Figure 1.1(g), if the cross-sectional area of the neck is A and the hydrometer is a distances above 
its normal floating level, the restoring force depends on the volume of liquid displaced (Archimedes’ 
principle). Show that u 2 = gpA/m. 

Check the dimensions of to 2 for each case. 

Problem 1.2 

Show by the choice of appropriate values for A and B in equation (1.2) that equally valid solutions 
for x are 



x = a cos {ut + (f > ) 
x = a sin {ut — cp) 
x = a cos {ut — (f>) 

and check that these solutions satisfy the equation 

x + u 2 x = 0 



Problem 1.3 

The pendulum in Figure 1.1(a) swings with a displacement amplitude a. If its starting point from rest 
is 



(a) x = a 

(b) x = —a 

find the different values of the phase constant cp for the solutions 

x = a sin (cot + cp) 
x = a cos {uit + (p ) 
x = a sin ( ut — cp) 
x = a cos {ut — (p) 

For each of the different values of cp, find the values of ut at which the pendulum swings through the 
positions 

x = +a/ \/2 
x = a/ 2 
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and 



x = Q 



for the first time after release from 



x = ±fl 



Problem 1.4 

When the electron in a hydrogen atom bound to the nucleus moves a small distance from its 
equilibrium position, a restoring force per unit distance is given by 

s = e 2 /4neor 2 

where r = 0.05 nm may be taken as the radius of the atom. Show that the electron can oscillate with 
a simple harmonic motion with 

uiq « 4.5 x 10~ 16 rads _1 

If the electron is forced to vibrate at this frequency, in which region of the electromagnetic spectrum 
would its radiation be found? 

e = 1.6 x 10~ 19 C, electron mass m e = 9.1 x 10~ 31 kg 
cq = 8.85 x 10~ 12 N _l m -2 C 2 



Problem 1.5 

Show that the values of w 2 for the three simple harmonic oscillations (a), (b), (c) in the diagram are 
in the ratio 1:2:4. 



(a) (b) (c) 




Problem 1.6 

The displacement of a simple harmonic oscillator is given by 

x = a sin (ujt + f) 

If the oscillation started at time t = 0 from a position xq with a velocity x = vq show that 

tanf = loxo/vq 



' — (x Q + Vq/u) 



2x1/2 



and 
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Problem 1.7 

A particle oscillates with simple harmonic motion along the x axis with a displacement amplitude a 
and spends a time dr in moving from x to x + dr. Show that the probability of finding it between x 
and x + dx is given by 



dx 

7r (a 2 — x 2 ) 

(in wave mechanics such a probability is not zero for x > a). 

Problem. 1.8 

Many identical simple harmonic oscillators are equally spaced along the x axis of a medium and a 
photograph shows that the locus of their displacements in the y direction is a sine curve. If the 
distance A separates oscillators which differ in phase by 2-7T radians, what is the phase difference 
between two oscillators a distance x apart? 

Problem 1.9 

A mass stands on a platform which vibrates simple harmonically in a vertical direction at a 
frequency of 5 Hz. Show that the mass loses contact with the platform when the displacement 
exceeds 10~ 2 m. 



Problem 1.10 

A mass M is suspended at the end of a spring of length / and stiffness s. If the mass of the spring is m 
and the velocity of an element dy of its length is proportional to its distance y from the fixed end of 
the spring, show that the kinetic energy of this element is 



1 

2 




2 



where v is the velocity of the suspended mass M. Hence, by integrating over the length of the spring, 
show that its total kinetic energy is ^mv 2 and, from the total energy of the oscillating system, show 
that the frequency of oscillation is given by 



LO 



2 



S 

M + m/3 



Problem 1.11 

The general form for the energy of a simple harmonic oscillator is 

E = ^mass (velocity) 2 + | stiffness (displacement) 2 

Set up the energy equations for the oscillators in Figure 1.1(a), (b), (c), (d), (e), (f) and (g), and use 
the expression 




to derive the equation of motion in each case. 
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Problem 1.12 

The displacement of a simple harmonic oscillator is given by x = a sin cut. If the values of the 
displacement x and the velocity x are plotted on perpendicular axes, eliminate t to show that the locus 
of the points ( x , x) is an ellipse. Show that this ellipse represents a path of constant energy. 

Problem 1.13 

In Chapter 12 the intensity of the pattern when light from two slits interferes (Young’s experiment) 
will be seen to depend on the superposition of two simple harmonic oscillations of equal amplitude a 
and phase difference S. Show that the intensity 

I = R 2 oc 4a 2 cos 2 6/2 

Between what values does the intensity vary? 

Problem 1.14 

Carry out the process indicated in the text to derive equation (1.3) on p. 15. 

Problem 1.15 

The co-ordinates of the displacement of a particle of mass m are given by 

x = a sin tot 
y = b cos cut 

Eliminate t to show that the particle follows an elliptical path and show by adding its kinetic and 
potential energy at any position x, y that the ellipse is a path of constant energy equal to the sum of 
the separate energies of the simple harmonic vibrations. 

Prove that the quantity m(xy — yx) is also constant. What does this quantity represent? 

Problem 1.16 

Two simple harmonic motions of the same frequency vibrate in directions perpendicular to each 
other along the x and y axes. A phase difference 

6 = (f> 2 ~ 4>i 

exists between them such that the principal axes of the resulting elliptical trace are inclined at an 
angle to the x and y axes. Show that the measurement of two separate values of x (or y) is sufficient to 
determine the phase difference. 

(Hint: use equation (1.3) and measure v(max), and y for (x = 0.) 

Problem 1.17 

Take a random group of n > 7 values of f in the range 0 < (j> < n and form the product 

n n 

COS (j)i COS (j)j 

1=1 7=1 

¥/ 

Show that the average value obtained for several such groups is negligible with respect to nil. 
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Problem 1.18 

Use the method of example (3) (p. 28) to show that 



a sinwt + a sin (cut + 5) + a sin (cut + 25) + • ■ • + a sin [tot + (n — 1)5] 



= a sin 



UJt + 



0 ~ i) 
2 



8 



sinrc5/2 
sin 5/2 



Problem 1.19 

If we represent the sum of the series 

acosuit + a cos (cot + 5) + a cos (cut + 25) + • • • + a cos [ujt + (n — 1)5] 
by the complex exponential form 

z = Ae iw '(l + e w + e iM + • • • + e^- 1 ^) 



show that 



zz* 



2 sin 2 n6/2 
sin 2 5/2 



Summary of Important Results 

Simple Harmonic Oscillator (mass m, stiffness 5, amplitude a) 
Equation of motion x + lo 2 x — 0 where lj 2 = s/m 
Displacement x = a sin (cot + 0) 

Energy = /nix 2 + \sx 2 = / mw 2 a 2 = ^ sa 2 = constant 



Superposition (Amplitude and Phase ) of two SHMs 
One-dimensional 

Equal uj, different amplitudes, phase difference 6, resultant R where R 2 = a\ + a\+ 
2 a \U 2 cos 6 

Different oj, equal amplitude. 



x = xi + X 2 = a(sincuif + sin oj 2 t) 



. (uJi+lV2)t (u)2 — U)\)t 
= 2 a sin cos 



Two-dimensional: perpendicular axes 

Equal ui, different amplitude — giving general conic section 

x 2 y 2 2xy , , , . . 2/ , , . 

— + S’ — cos (0 2 = sm (02 - 0 1 ) 

a\ a 2 a\a 2 
(basis of optical polarization) 
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Superposition of n SHM Vectors (equal amplitude a , constant successive phase difference 8) 
The resultant is R cos (cut + a), where 

sinrc<5/2 

R = a ■ x/-> 
sin 0/2 



and 



a = (n — 1)8/2 

Important in optical diffraction and wave groups of many components. 
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Damped Simple Harmonic Motion 



Initially we discussed the case of ideal simple harmonic motion where the total energy 
remained constant and the displacement followed a sine curve, apparently for an infinite 
time. In practice some energy is always dissipated by a resistive or viscous process; for 
example, the amplitude of a freely swinging pendulum will always decay with time as 
energy is lost. The presence of resistance to motion means that another force is active, 
which is taken as being proportional to the velocity. The frictional force acts in the 
direction opposite to that of the velocity (see Figure 2.1) and so Newton’s Second law 
becomes 



nix = — sx — rk 



where r is the constant of proportionality and has the dimensions of force per unit of 
velocity. The presence of such a term will always result in energy loss. 

The problem now is to find the behaviour of the displacement x from the equation 

nix + rx + sx — 0 (2-1) 



where the coefficients m, r and s are constant. 

When these coefficients are constant a solution of the form x = Ce“ r can always be 
found. Obviously, since an exponential term is always nondimensional, C has the 
dimensions of x (a length, say) and a has the dimensions of inverse time, T 1 . We shall 
see that there are three possible forms of this solution, each describing a different 
behaviour of the displacement x with time. In two of these solutions C appears explicitly as 
a constant length, but in the third case it takes the form 

C = A + Bt* 



* The number of constants allowed in the general solution of a differential equation is always equal 
to the order (that is, the highest differential coefficient) of the equation. The two values A and B are 
allowed because equation (2.1) is second order. The values of the constants are adjusted to satisfy the 
initial conditions. 
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x 




Figure 2.1 Simple harmonic motion system with a damping or frictional force rx acting against the 
direction of motion. The equation of motion is mx + rx + sx = 0 



where A is a length, B is a velocity and t is time, giving C the overall dimensions of a 
length, as we expect. From our point of view this case is not the most important. 

Taking C as a constant length gives x = «C e and x = cn 2 Ce at , so that equation (2.1) 
may be rewritten 



C e at (mct 2 + ra + s) — 0 



so that either 



x = C e at — 0 (which is trivial) 



or 



ma 2 + ra + s = 0 

Solving the quadratic equation in ol gives 



— r / r 2 s 

2m V 4m 2 m 

Note that r /2m and (s/m) 1 /2 , and therefore, a, all have the dimensions of inverse time, 
T~\ which we expect from the form of e“'. 

The displacement can now be expressed as 

-n/2m \-(r 2 /4m 2 -s/tn) l/2 / / ~ r!/2rn -(r 2 f4m 2 -s/rn ) l/2 / 

or the sum of both these terms 



—rt/2m+(r 2 /4m 2 —s/m) l / 2 t , —rt/lm—{r 2 /Am 2 —s/m) { ^ 2 t 

Ji ■ — Ji 2 I yi 2 — 0- J C | l-' 2 O 

The bracket (r 2 /4m 2 — s/m) can be positive, zero or negative depending on the relative 
magnitude of the two terms inside it. Each of these conditions gives one of the three 
possible solutions referred to earlier and each solution describes a particular kind of 
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behaviour. We shall discuss these solutions in order of increasing significance from our 
point of view; the third solution is the one we shall concentrate upon throughout the rest of 
this book. 

The conditions are: 

( 1 ) Bracket positive ( r 2 /Am 2 > s/m). Here the damping resistance term r 1 /Am 2 
dominates the stiffness term s/m, and heavy damping results in a dead beat system. 

(2) Bracket zero (r 2 /Am 2 — s/m). The balance between the two terms results in a 
critically damped system. 

Neither (1) nor (2) gives oscillatory behaviour. 

(3) Bracket negative (r 2 /Am 2 < s/m). The system is lightly damped and gives oscillatory 
damped simple harmonic motion. 

Case 1. Heavy Damping 

Writing r/2m — p and (r 2 /Am 2 — s/m) 1 ,/2 = q , we can replace 

^ i ^—rt/2m+(r 2 /4m 2 —s/m) l ^ 2 t _j_ ^ - ^ ^ - rt/2m -(r 2 /4m 2 -s/m) 1/2 1 



by 



x — e~ pt (Ci e qt + C 2 e _?f ), 

where the C i and Ci are arbitrary in value but have the same dimensions as C (note that 
two separate values of C are allowed because the differential equation (2.1) is second 
order). 

If now F = Ci + Ci and G = C\ — Co, the displacement is given by 






e~ qt ) 



or 



* = e P \F cosh qt + Gsinh qt) 

This represents non-oscillatory behaviour, but the actual displacement will depend upon 
the initial (or boundary) conditions; that is, the value of x at time t — 0. If x = 0 at t = 0 
then F = 0, and 



x = 



G e -rt/ 2m sinh 





1/2 



t 



Figure 2.2 illustrates such behaviour when a heavily damped system is disturbed from 
equilibrium by a sudden impulse (that is, given a velocity at t — 0). It will return to zero 
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Figure 2.2 Non-osciLlatory behaviour of damped simple harmonic system with heavy damping 
(where r 2 /4m 2 > s/m) after the system has been given an impulse from a rest position x = 0 



displacement quite slowly without oscillating about its equilibrium position. More 
advanced mathematics shows that the value of the velocity dx/dt vanishes only once so that 
there is only one value of maximum displacement. 



(Problem 2.1) 

Case 2. Critical Damping (r 2 /4m 2 = s/m) 

Using the notation of Case 1, we see that q — 0 and that x = e~ p '(C\ + C 2 ). This is, in 
fact, the limiting case of the behaviour of Case I as q changes from positive to negative. In 
this case the quadratic equation in a has equal roots, which, in a differential equation 
solution, demands that C must be written C = A + Bt, where A is a constant length and B a 
given velocity which depends on the boundary conditions. It is easily verified that the value 

x = (A + Bt)e~ rt/2m = (A + Bt)e~ pt 

satisfies nix + rx + sx = 0 when r 2 /Am 2 — s/m. 



(Problem 2.2) 

Application to a Damped Mechanical Oscillator 

Critical damping is of practical importance in mechanical oscillators which experience 
sudden impulses and are required to return to zero displacement in the minimum time. 
Suppose such a system has zero displacement at t — 0 and receives an impulse which gives 
it an initial velocity V 
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Then x = 0 (so that A — 0) and x = V at t = 0. However, 

x = B[(—pt)e~ pl + e~ pt ] = B at t = 0 

so that B = V and the complete solution is 



x = Vte~ pt 



The maximum displacement x occurs when the system comes to rest before returning to 
zero displacement. At maximum displacement 



x = Vt~ pt {\ -pt) = 0 

thus giving (1 — pi) = 0, i.e. t = 1 jp. 

At this time the displacement is therefore 



V 

x—Vte p> — — e 
P 



l 



= 0.368- = 0.368 
P 



2 mV 
r 



The curve of displacement versus time is shown in Figure 2.3; the return to zero in a 
critically damped system is reached in minimum time. 



Case 3. Damped Simple Harmonic Motion 

When r 2 /Am 2 < s/m the damping is light, and this gives from the present point of view the 
most important kind of behaviour, oscillatory damped simple harmonic motion. 



Displacement 




Figure 2.3 Limiting case of non-oscillatory behaviour of damped simple harmonic system where 
r 2 /4m 2 = s/m (critical damping) 
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The expression (r 2 /Am 2 — s/m) 1 ^ 2 is an imaginary quantity, the square root of a 
negative number, which can be rewritten 



± 



f r 2 s 

V ■ 4m 2 



1/2 



= wrr(*-T, 



\m 4m 2 J 
„2 \ 1/2 



1/2 



= ±i( — j 

v m 4m z 



(where i = v / — T) 



so the displacement 

^ rl /2m ^+i(s/m—r 2 /4m 2 ) 1 ^ 2 t ^ ^ - rl /2m \(s/m— r 2 /4m 2 ) l ^ 2 t 

The bracket has the dimensions of inverse time; that is, of frequency, and can be written 
(s/m — r 2 /4m 2 ) 1 / 2 = to ' , so that the second exponential becomes e lu '‘ — coso/f+ 
isin uj't. This shows that the behaviour of the displacement x is oscillatory with a new 
frequency u/ < u> = (s/m ) 1 , the frequency of ideal simple harmonic motion. To compare 
the behaviour of the damped oscillator with the ideal case we should like to express the 
solution in a form similar to x = A sin (uj't + ()>) as in the ideal case, where c o has been 
replaced by lu'. 

We can do this by writing 



If we now choose 



x=Q -rt/2m 



(c ie iw '' + 



C 2 e* iwr ) 



c '=i eH 



and 



C 2 = 



A 

2i 



e - * 



where A and </> (and thus e 1 - 6 ) are constants which depend on the motion at t — 0, we find 
after substitution 



_ A -n/ 2m [e i( ^ ) -e- i ^)] 



x = A e 



2i 



= Ae“" /2m sin(u;'t + <(>) 

This procedure is equivalent to imposing the boundary condition x — A sin cp at t — 0 
upon the solution for x. The displacement therefore varies sinusoidally with time as in the 
case of simple harmonic motion, but now has a new frequency 

1/2 



U> = 



m 4m 2 
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rt 




Figure 2.4 Damped oscillatory motion where s/m > r 2 /^m 2 . The amplitude decays with e rt / 2m , 
and the reduced angular frequency is given by w' 2 = s/m — r 2 /4m 2 



and its amplitude A is modified by the exponential term e“ r, ' /2m , a term which decays with 
time. 

If x = 0 at t = 0 then 0 = 0; Figure 2.4 shows the behaviour of x with time, its 
oscillations gradually decaying with the envelope of maximum amplitudes following the 
dotted curve e rt / 2m , The constant A is obviously the value to which the amplitude would 
have risen at the first maximum if no damping were present. 

The presence of the force term rx in the equation of motion therefore introduces a loss of 
energy which causes the amplitude of oscillation to decay with time as e~ r, / 2m . 

(Problem 2.3) 

Methods of Describing the Damping of an Oscillator 

Earlier in this chapter we saw that the energy of an oscillator is given by 

E = \ma 2 ui 2 = \sa 2 

that is, proportional to the square of its amplitude. 

We have just seen that in the presence of a damping force rx the amplitude decays with 
time as 



—rt/2m 

so that the energy decay will be proportional to 

2 

that is, c The larger the value of the damping force r the more rapid the decay of the 
amplitude and energy. Thus we can use the exponential factor to express the rates at which 
the amplitude and energy are reduced. 
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Logarithmic Decrement 

This measures the rate at which the amplitude dies away. Suppose in the expression 

x = A e~ rt l 2m sin (u't + Q) 



we choose 



4> = 7t/2 



and we write 



X = AqQ r ' /,2m COS uj't 

with x = To at t = 0. Its behaviour will follow the curve in Figure 2.5. 

If the period of oscillation is r' where u/ = 27 t/t', then one period later the amplitude is 
given by 

Ai = A 0 e^ r / 2m ^' 

so that 

^0 = e rr'/2m = 6 

Al 



e 2m 




Figure 2.5 The logarithmic ratio of any two amplitudes one period apart is the logarithmic 
decrement, defined as S = log e (4„/4 n+ i) = rr'/2m 
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where 



6 = 




Ap 

A~i 



is called the logarithmic decrement. (Note that this use of 6 differs from that in Figure 1.11). 
The logarithmic decrement 6 is the logarithm of the ratio of two amplitudes of oscillation 
which are separated by one period, the larger amplitude being the numerator since e s > 1. 
Similarly 

_ e r(2r')/2m _ 26 

a 2 

and 

— = e n6 



Experimentally, the value of 6 is best found by comparing amplitudes of oscillations 
which are separated by n periods. The graph of 



log e 



A 

A 



o 

n 



versus n for different values of n has a slope 6. 



Relaxation Time or Modulus of Decay 

Another way of expressing the damping effect is by means of the time taken for the 
amplitude to decay to 

e- 1 =0.368 

of its original value Ap. This time is called the relaxation time or modulus of decay and the 
amplitude 

A t — Ape~ rt ! 2m = A 0 e _1 

at a time t = 2m/ r. 

Measuring the natural decay in terms of the fraction e~* of the original value is a very 
common procedure in physics. The time for a natural decay process to reach zero is, of 
course, theoretically infinite. 



(Problem 2.4) 



The Quality Factor or Q-value of a Damped Simple Harmonic Oscillator 

This measures the rate at which the energy decays. Since the decay of the amplitude is 
represented by 



A=ApQ- r, ! 2m 
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the decay of energy is proportional to 



A 2 



A 2 e (- rt /2m) 2 



and may be written 



E = £ 0 e (_r/m)r 

where Eq is the energy value at t = 0. 

The time for the energy E to decay to £o e 1 is given by t = m/r s during which time the 
oscillator will have vibrated through t o' m/r rad. 

We define the quality factor 



as the number of radians through which the damped system oscillates as its energy 
decays to 



E = Eq e~' 



If r is small, then Q is very large and 



5 

m 



> 



r 



2 



4m 2 



so that 



5 \ 1/2 



U) « UJ 0 — — 

. m 



Thus, we write, to a very close approximation, 

q _ m 
r 

which is a constant of the damped system. 

Since r/m now equals cjq/Q we can write 

E = Eq e ( ‘ r/m)r = Eq 

The fact that Q is a constant (= cuq m/r) implies that the ratio 



energy stored in system 
energy lost per cycle 
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is also a constant, for 



Q u>o m vq m 

It: lirr r 



is the number of cycles (or complete oscillations) through which the system moves in 
decaying to 

E^Eqq - 1 

and if 



E = E 0 e { - r/m)t 



the energy lost per cycle is 



, AE A —r 1 
—A E = —— At = — E — 
At m v' 



where At = 1 /v' 
Thus, the ratio 



t', the period of oscillation. 



energy stored in system E v'm 
energy lost per cycle — AE r 

2n 



vq m 
r 



In the next chapter we shall meet the same quality factor Q in two other roles, the first as 
a measure of the power absorption bandwidth of a damped oscillator driven near its 
resonant frequency and again as the factor by which the displacement of the oscillator 
is amplified at resonance. 



Example on the Q-value of a Damped Simple Harmonic Oscillator 

An electron in an atom which is freely radiating power behaves as a damped simple 
harmonic oscillator. 

If the radiated power is given by P = 1 2ttcoc 3 W at a wavelength of 0.6 pm 

(6000 A), show that the <2- value of the atom is about 10 8 and that its free radiation lifetime 
is about 10 8 s (the time for its energy to decay to e 1 of its original value). 

q= 1.6 x 10~ 19 C 
1 /47T£o = 9 x 10 9 mF -1 
m e — 9 x 10~ 31 kg 
c = 3 x 10 8 ms -1 

vo = maximum amplitude of oscillation 

The radiated power P is — v AE, where — AE is the energy loss per cycle, and the energy of 
the oscillator is given by E = \m e oj 2 x q. 
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Thus, Q = 2tt£/ — A E = in: m e cj 2 x\/P, and inserting the values above with ui — 2 tuj — 
2ttc/X, where the wavelength A is given, yields a Q value of ~ 5 x 10 7 . 

The relation Q = cot gives t, the radiation lifetime, a value of ~ 10“ 8 s. 

Energy Dissipation 

We have seen that the presence of the resistive force reduces the amplitude of oscillation 
with time as energy is dissipated. 

The total energy remains the sum of the kinetic and potential energies 

E = \mx 2 + \sx 2 

Now, however, dE/dt is not zero but negative because energy is lost, so that 

d E d . . , , .... , 

— = — Hmx + ±sx~) = x(mx + sx ) 
df d/' 2 2 ’ v 7 

= x(—rx) for mx + rx + sx = 0 

i.e. dE/dt = —rx 2 , which is the rate of doing work against the frictional force (dimensions 
of force x velocity = force x distance/time). 

(Problems 2.5, 2.6) 

Damped SHM in an Electrical Circuit 

The force equation in the mechanical oscillator is replaced by the voltage equation in the 
electrical circuit of inductance, resistance and capacitance (Figure 2.6). 




L — 
df 



/Rh 




Figure 2.6 Electrical 
harmonic oscillations. 

asL^ + RI + ^0 
at C 



circuit of inductance, capacitance and resistance capable of damped simple 
The sum of the voltages around the circuit is given from Kirchhoff's law 
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We have, therefore, 



d I q 

L— + RI + j; = 0 
dr C 



Lq + Rq + — — 0 



and by comparison with the solutions for x in the mechanical case we know immediately 
that the charge 



q = q oe 



-Rt/2L± {R 2 /4L 2 -1/LC) 1,2 1 



which, for l/LC > R 2 /4L 2 , gives oscillatory behaviour at a frequency 

2 _ 1 R 2 
~LC~4L 2 

From the exponential decay term we see that R/L has the dimensions of inverse time 7' 1 
or u>, so that uiL has the dimensions of R\ that is, loL is measured in ohms. 

Similarly, since u: 2 = 1 /LC,uiL = 1 /loC, so that 1 /ujC is also measured in ohms. We 
shall use these results in the next chapter. 



(Problems 2.7, 2.8, 2.9) 



Problem 2.1 

The heavily damped simple harmonic system of Figure 2.2 is displaced a distance F from its 
equilibrium position and released from rest. Show that in the expression for the displacement 




x = e P \F cosh qt + G sinh qt) 


where 


r ( r 2 s\ '^ 2 

p = - and 


that the ratio 


G r 

F ( r 2 _ 4, HiS ) '/ 2 


Problem 2.2 

Verify that the solution 


x= (A + Bt)e -rt / 2m 
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satisfies the equation 



nix + rx + sx = 0 



when 



r 2 /Am 2 = s/m 



Problem 2.3 

The solution for damped simple harmonic motion is given by 

x = e _rt / 2 m (C 1 e ia/ ' + C 2 e _i "'') 

If x = A cos (p at t = 0, find the values of C i and C 2 to show that x sa —uj'A sin </> at t = 0 only if r/m 
is very small or </> 7t/2. 

Problem 2.4 

A capacitance C with a charge q 0 at t = 0 discharges through a resistance I?. Use the voltage 
equation q/C + IR = 0 to show that the relaxation time of this process is RC s; that is. 



(Note that t/RC is non-dimensional.) 

Problem 2.5 

The frequency of a damped simple harmonic oscillator is given by 

2 2 
,2 S r 2 r 

CO — - ~ = COq ~ — r 

m 4 m z 4 m A 

(a) If Wq — u>' 2 = 10 _6 tUo show that Q = 500 and that the logarithmic decrement 6 = 7 t/ 500 . 

(b) If wo = 10 6 and m = 10 -10 Kg show that the stiffness of the system is lOONrn -1 , and that the 
resistive constant r is 2 x 10 “ 7 N • sm -1 . 

(c) If the maximum displacement at t = 0 is 10~ 2 m, show that the energy of the system is 5 x 10~ 3 
I and the decay to e _1 of this value takes 0.5 ms. 

(d) Show that the energy loss in the first cycle is 27 r x 10 “ 5 J. 

Problem 2.6 

Show that the fractional change in the resonant frequency Wo(wq = s/m) of a damped simple 
harmonic mechanical oscillator is ss (8£> 2 ) -1 where Q is the quality factor. 

Problem 2.7 

Show that the quality factor of an electrical LCR series circuit is Q = loqL/R where Wq = 1/LC 

Problem 2.8 

A plasma consists of an ionized gas of ions and electrons of equal number densities («,■ = n e = n) 
having charges of opposite sign ±e, and masses nij and m e , respectively, where m, > m e . Relative 
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displacement between the two species sets up a restoring 



+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 



E 



I 



electric field which returns the electrons to equilibrium, the ions being considered stationary. In the 
diagram, a plasma slab of thickness / has all its electrons displaced a distance x to give a restoring 
electric field E = nex/e o, where £q is constant. Show that the restoring force per unit area on the 
electrons is xn 2 e 2 l/e o and that they oscillate simple harmonically with angular frequency lo 2 = 
ne 2 /m e e o- This frequency is called the electron plasma frequency, and only those radio waves of 
frequency u> > cu e will propagate in such an ionized medium. Hence the reflection of such waves 
from the ionosphere. 

Problem 2.9 

A simple pendulum consists of a mass m at the end of a string of length / and performs small 
oscillations. The length is very slowly shortened whilst the pendulum oscillates many times at a 
constant amplitude 18 where 8 is very small. Show that if the length is changed by —A/ the work 
done is —mgAI (owing to the elevation of the position of equilibrium) together with an increase in 
the pendulum energy 

A E = I mg — — mid 2 J A / 

where 8 2 is the average value of 8 2 during the shortening. If 8 = 8q cos uit, show that the energy of 
the pendulum at any instant may be written 



and hence show that 



E = 



ml 2 uj 2 dl 

2 



mgl8p 

2 



A E lAl Av 

~E~ ~ ~2T~ — 

that is, E/v, the ratio of the energy of the pendulum to its frequency of oscillation remains constant 
during the slowly changing process. (This constant ratio under slowly varying conditions is 
important in quantum theory where the constant is written as a multiple of Planck’s constant, h.) 




52 



Damped Simple Harmonic Motion 



Summary of Important Results 

Damped Simple Harmonic Motion 

Equation of motion nix + rx + sx = 0 
Oscillations when 

? 

s r 
m 4 m 2 

Displacement x = A e~ r, / 2m cos (oj't + 4>) where 

.J2 _ s 

m 4 m 2 



Amplitude Decay 

Logarithmic decrement 6 — the logarithm of the ratio of two successive amplitudes one 
period r' apart 



x A n rr 

8 = log e 

A„ + i 2m 



Relaxation Time 

Time for amplitude to decay to A = Aq e n,lm — Aq e _1 ; that is, t = 2m j r 



Energy Decay 

Quality factor Q is the number of radians during which energy decreases to E = £ (l e 

ujQin ^ energy stored in system 
r energy lost per cycle 

E — Eq Q~ rt l m — e 0 e _1 when Q = u>ot 



In damped SHM 



d E 
df 



(nix + sx)x = —rx 2 (work rate of resistive force) 



For equivalent expressions in electrical oscillators replace m by L, r by R and .v by 1/C. 
Force equations become voltage equations. 
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The Forced Oscillator 

The Operation of i upon a Vector 

We have already seen that a harmonic oscillation can be conveniently represented by the 
form e lw '. In addition to its mathematical convenience i can also be used as a vector 
operator of physical significance. We say that when i precedes or operates on a vector the 
direction of that vector is turned through a positive angle (anticlockwise) of 7r/2, i.e. i 
acting as an operator advances the phase of a vector by 90°. The operator — i rotates the 
vector clockwise by tt/ 2 and retards its phase by 90°. The mathematics of i as an operator 
differs in no way from its use as a/— T and from now on it will play both roles. 

The vector r = a + ib is shown in Figure 3.1, where the direction of b is perpendicular to 
that of a because it is preceded by i. The magnitude or modulus or r is written 

r = |r| = ( a 2 + b 2 ) ^ 2 



and 



r 2 = ( a 2 + b 2 ) = (a + ib)(a - ib) = rr*, 

where (a — ib) = r* is defined as the complex conjugate of (a + ib); that is, the sign of i is 
changed. 

The vector r* = a — ib is also shown in Figure 3.1. 

The vector r can be written as a product of its magnitude r (scalar quantity) and its phase 
or direction in the form (Figure 3.1) 

r = re' 5 ’ = r(cos <fi + i sin (f>) 

— a + ib 



showing that a = r cos (f> and b — r sin q b. 
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Figure 3.1 Vector representation using i operator and exponential index. Star superscript indicates 
complex conjugate where — i replaces i 



It follows that 



and 



giving tan</> = b/a. 
Similarly 



COS (f) 



a 

r 



a 

(a 2 + b 2 ) 




b 

{a 2 + b 2 ) l/2 



r * = re ^ = r( cos cf) — i sin (j>) 

a —b —b 

cos (j) = -, sin<p — — and tan<j> = — (Figure 3.1) 
r r a 

The reader should confirm that the operator i rotates a vector by 7 t/ 2 in the positive 
direction (as stated in the first paragraph of p. 53) by taking cf> = it 12 in the expression 

r = re'^ = r(cos7r/2 + isin7r/2) 

Note that 4> = — 7r/2 in r = re l7r,/2 rotates the vector in the negative direction. 

Vector form of Ohm's Law 

Ohm’s Law is first met as the scalar relation V = IR , where V is the voltage across the 
resistance R and I is the current through it. Its scalar form states that the voltage and current 
are always in phase. Both will follow a sin (cot + <f>) or a cos (cot + <f>) curve, and the value 
of 0 will be the same for both voltage and current. 

However, the presence of either or both of the other two electrical components, 
inductance L and capacitance C, will introduce a phase difference between voltage and 
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d/ h 

+ df _ + IR _ , ,_C 



c 



/ = / 0 e imt 



Figure 3.2a An electrical forced oscillator. The voltaqe I /„ is applied to the series LCR circuit qivinq 
V a = Ldl/dt + IR+q/C 



current, and Ohm’s Law takes the vector form 

V = IZ e , 

where Z (> , called the impedance , replaces the resistance, and is the vector sum of the 
effective resistances of R , L, and C in the circuit. 

When an alternating voltage V a of frequency c o is applied across a resistance, inductance 
and condenser in series as in Figure 3.2a, the balance of voltages is given by 

V a = IR + Lj t + q/C 

and the current through the circuit is given by I = 1q e luJt . The voltage across the inductance 

V L = L — = L - I 0 e iwf = iuj LI o e iuJt = i wLI 

df df 

But ojL, as we saw at the end of the last chapter, has the dimensions of ohms, being the 
value of the effective resistance presented by an inductance L to a current of frequency uj. 
The product loLI with dimensions of ohms times current, i.e. volts, is preceded by i; this 
tells us that the phase of the voltage across the inductance is 90° ahead of that of the current 
through the circuit. 

Similarly, the voltage across the condenser is 



1 

C 



1 

c 



Idt = -I 0 
C 



1 

it oC 



' d t = T V- / 0 e “* 



i / 

ojC 



(since 1/i = — i). 

Again 1 /loC, measured in ohms, is the value of the effective resistance presented by the 
condenser to the current of frequency ui. Now, however, the voltage I/ojC across the 
condenser is preceded by — i and therefore lags the current by 90°. The voltage and current 
across the resistance are in phase and Figure 3.2b shows that the vector form of Ohm's 
Law may be written V= YL e = l[R + \{u>L— \/ loC)\, where the impedance Z,, = 
R + i(uiL — 1 /luC). The quantities ivL and 1/c oC are called reactances because they 
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Tl 




Figure 3.2b Vector addition of resistance and reactances to give the electrical impedance Z e = 

R + i (ljL — 1 /loC) 



introduce a phase relationship as well as an effective resistance, and the bracket 
( ujL — I /loC) is often written X e , the reactive component of Z e . 

The magnitude, in ohms, i.e. the value of the impedance, is 



Z, 



R 2 + 




1/2 



and the vector Z e may be represented by its magnitude and phase as 

Z e = Z e e l<t> — Z e ( cos 4> + i sin 4>) 



so that 

, R • , X e 

cos a — — , sin 0 — — 

* Z e ’ v Z e 

and 

tanc j) = X e /R, 

where (f> is the phase difference between the total voltage across the circuit and the current 
through it. 

The value of 4> can be positive or negative depending on the relative value of loL and 
1/tuC: when loL > I /ujC. & is positive, but the frequency dependence of the components 
show that (j) can change both sign and size. 

The magnitude of Z e is also frequency dependent and has its minimum value Z e = R 
when ojL = 1/cuC. 

In the vector form of Ohm’s Law, V = IZ^. If V = V()S lu)t and Z,, = Z ( , e'L then we have 

l Vo^‘ Vo c H. t -*) 

Z e e 1 ^ Z e 

giving a current of amplitude Vo /Z e which lags the voltage by a phase angle (j). 



The Impedance of a Mechanical Circuit 

Exactly similar arguments hold when we consider not an electrical oscillator but a 
mechanical circuit having mass, stiffness and resistance. 
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The mechanical impedance is defined as the force required to produce unit velocity in 
the oscillator, i.e. Z m = F/v or F = vZ,„. 

Immediately, we can write the mechanical impedance as 

Z„, = r + i (utm ^ = r + iX,„ 



where 



and 



Z , tl — Z, t , e 



tan f = X m /r 

f being the phase difference between the force and the velocity. The magnitude of Z m = 

[; r 2 + (com — s/u>)~] l '~. 

Mass, like inductance, produces a positive reactance, and the stiffness behaves in exactly 
the same way as the capacitance. 



Behaviour of a Forced Oscillator 

We are now in a position to discuss the physical behaviour of a mechanical oscillator of 
mass m, stiffness .v and resistance r being driven by an alternating force F o cos cut, where F o 
is the amplitude of the force (Figure 3.3). The equivalent electrical oscillator would be an 
alternating voltage To cos tut applied to the circuit of inductance L, capacitance C and 
resistance R in Figure 3.2a. 

The mechanical equation of motion, i.e. the dynamic balance of forces, is given by 

nix + rx + sx = F o cos tut 
and the voltage equation in the electrical case is 

Lq + Rq + q/C = Vq cos tut 

We shall analyse the behaviour of the mechanical system but the analysis fits the electrical 
oscillator equally well. 



inmnnr 






m 



Figure 3.3 Mechanical forced osciLlator with force F 0 cos tot applied to damped mechanical circuit 
of Figure 2.1 
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The complete solution for x in the equation of motion consists of two terms: 

(1) a ‘transient’ term which dies away with time and is, in fact, the solution to the equation 
rnx + rx + sx = 0 discussed in Chapter 2. This contributes the term 

^ ^ - rt /2m ^ \isfni - r 2 /4m 1 ) 1/2 / 

which decays with The second term 

(2) is called the ‘steady state' term, and describes the behaviour of the oscillator after the 
transient term has died away. 

Both terms contribute to the solution initially, but for the moment we shall concentrate 
on the ‘steady state’ term which describes the ultimate behaviour of the oscillator. 

To do this we shall rewrite the force equation in vector form and represent cos cut by e lut 
as follows: 



mx + rx + sx — F oe“ (3-1) 

Solving for the vector x will give both its magnitude and phase with respect to the driving 
force Fq e ,UJl . Initially, let us try the solution x = A e luJf , where A may be complex, so that it 
may have components in and out of phase with the driving force. 

The velocity 



x = ituAe 1 "' = icux 



so that 



”•22 2 
X = 1 OJ X = —UJ X 



and equation (3.1) becomes 

(-A cj 2 m + icoAr + As) e iuJI = F 0 e iuJt 
which is true for all t when 

A = 7 — 

iuir + (s — U! 2 m) 

or, after multiplying numerator and denominator by — i 

-iF 0 — iF 0 



A = 



Hence 



co[r + i(t om — s/u>)\ ufL„ 



■ wt — iF 0 e iuJt —iFo e‘“ ; 



x = Ae“ = 



UjZ 



‘-iF 0 e i ^~^ 

utZ m 



u>Z m e ^ 
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where 



Z m — [r 2 + (tom — s/tv ) 2 } '' 

This vector form of the steady state behaviour of x gives three pieces of information and 
completely defines the magnitude of the displacement x and its phase with respect to the 
driving force after the transient term dies away. It tells us 

1 . That the phase difference f exists between x and the force because of the reactive part 
(turn — s/co) of the mechanical impedance. 

2. That an extra difference is introduced by the factor — i and even if <1) were zero the 
displacement x would lag the force Fp cos cut by 90°. 

3. That the maximum amplitude of the displacement x is Fp/loZ,,,. We see that this is 
dimensionally correct because the velocity x/t has dimensions Fp/Z,„. 

Having used F o e 1 ^' to represent its real part F o cos tut, we now take the real part of the 
solution 



— iFoe’^ ^ 
x = 

t oZi m 

to obtain the actual value of x. (If the force had been F o sin c at, we would now take that part 
of x preceded by i.) 

Now 



x 



_ £ i (u>t-<j>) 



LoZ m 

iF 0 

ujZj hi 

iF 0 
cuZ in 



[cos (tut — <j>) + i sin (tut — 



cos (tut — 4>) 



tuZ„ 



sin (tut — f) 



The value of x resulting from Fp cos tut is therefore 



x = —— sin (tut — 4>) 

(jjZyn 

[the value of x resulting from Fp sin tut would be — Fp cos (tut — f) /uiZ„^ . 

Note that both of these solutions satisfy the requirement that the total phase difference 
between displacement and force is f plus the — 7t/2 term introduced by the — i factor. When 
0 = 0 the displacement x = Fp sin tut/tuZ m lags the force Fp cos tut by exactly 90°. 




60 



The Forced Oscillator 



To find the velocity of the forced oscillation in the steady state we write 

v = X = (i u) e i(wf -^ 



F o 

= — e 
Z 



ioZ m 

i (uJt—<t>) 



We see immediately that 



1 . There is no preceding i factor so that the velocity v and the force differ in phase only 
by <j), and when f — 0 the velocity and force are in phase. 

2. The amplitude of the velocity is Fq/Z,„, which we expect from the definition of 
mechanical impedance Z„, = F/v. 



Again we take the real part of the vector expression for the velocity, which will 
correspond to the real part of the force F oe lwf . This is 

v = — - cos (c ot — 4>) 

Thus, the velocity is always exactly 90° ahead of the displacement in phase and differs 
from the force only by a phase angle f, where 

tom — s /to X m 

r r 



tan 0 = 



so that a force Fq cos t ot gives a displacement 

F 0 



x = 



ojZ„ 



sin (uit — i j>) 



and a velocity 



v = — — cos (c ot — <f>) 
Z I,, 



(Problems 3.1, 3.2, 3.3, 3.4) 



Behaviour of Velocity v in Magnitude and Phase versus Driving 
Force Frequency co 

The velocity amplitude is 



F o Fo 

Z m [r 2 + (urn — s/u ) 2 ] 

so that the magnitude of the velocity will vary with the frequency io because Z,„ is 
frequency dependent. 
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co 0 = (s/m) 2 



Figure 3.4 Velocity of forced oscillator versus driving frequency lo. Maximum velocity v max = F 0 /r 
at io z 0 = s/m 



At low frequencies, the term — s/lo is the largest term in Z,„ and the impedance is said to 
be stiffness controlled. At high frequencies com is the dominant term and the impedance is 
mass controlled. At a frequency loq where loqiu = s/lo o, the impedance has its minimum 
value Z,„ = r and is a real quantity with zero reactance. 

The velocity Fq/Z,,, then has its maximum value v — Fo/r. and loq is said to be the 
frequency of velocity resonance. Note that tan 0 = 0 at loq , the velocity and force being in 
phase. 

The variation of the magnitude of the velocity with driving frequency, lo, is shown in 
Figure 3.4, the height and sharpness of the peak at resonance depending on r, which is the 
only effective term of Z,„ at too- 

The expression 



F o 



cos (got — 0) 



where 



, loiu — s/lo 

tan 0 = — 

r 

shows that for positive 0; that is, com > s/lo, the velocity v will lag the force because —i> 
appears in the argument of the cosine. When the driving force frequency lo is very high and 
lo — » oo, then 0 — » 90° and the velocity lags the force by that amount. 

When ioni < s/lo, 0 is negative, the velocity is ahead of the force in phase, and at low 
driving frequencies as oo — *• 0 the term s/lo — > oo and 0 — » —90°. 

Thus, at low frequencies the velocity leads the force ( 0 negative) and at high frequencies 
the velocity lags the force ( 0 positive). 

At the frequency lo o, however, loqiu = s/lo o and 0 = 0, so that velocity and force are in 
phase. Figure 3.5 shows the variation of 0 with lo for the velocity, the actual shape of the 
curves depending upon the value of r. 
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Figure 3.5 Variation of phase angle <fi versus driving frequency, where <j> is the phase angle between 
the velocity of the forced oscillator and the driving force. <j> = 0 at velocity resonance. Each curve 
represents a fixed resistance value 



(Problem 3.5) 



Behaviour of Displacement versus Driving Force Frequency a> 



The phase of the displacement 



F o 
cuZ m 



sin (cut — (p) 



is at all times exactly 90° behind that of the velocity. Whilst the graph of </> versus cu 
remains the same, the total phase difference between the displacement and the force 
involves the extra 90° retardation introduced by the — i operator. Thus, at very low 
frequencies, where </> = — 7r/2 rad and the velocity leads the force, the displacement and 
the force are in phase as we should expect. At high frequencies the displacement lags the 
force by 7r rad and is exactly out of phase, so that the curve showing the phase angle 
between the displacement and the force is equivalent to the versus cu curve, displaced by 
an amount equal to 7t/2 rad. This is shown in Figure 3.6. 

The amplitude of the displacement x = Fo/cuZ m , and at low frequencies Z m — 
[r 2 + ( cum — s/uu) 2 ] 1/2 — > s/cu, so that x « Fq/(cus/cu) — Fq/s. 




Figure 3.6 Variation of total phase angle between displacement and driving force versus driving 
frequency cu. The total phase angle is —cj) — 7r/2 rad 




Behaviour of Displacement versus Driving Force Frequency to 
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At high frequencies Z m — > urn, so that .r « Fo/(iv 2 m), which tends to zero as t o becomes 
very large. At very high frequencies, therefore, the displacement amplitude is almost zero 
because of the mass-controlled or inertial effect. 

The velocity resonance occurs at uJq = s/m, where the denominator Z m of the velocity 
amplitude is a minimum, but the displacement resonance will occur, since x = (Fq/ivZ,,,) 
sin (cot — 0 ), when the denominator coZ m is a minimum. This takes place when 

(c vZ m ) = — cu[r 2 + (tom — s/tv) 2 ] 1//2 = 0 
dcu dtu 

i.e. when 

2uir 2 + 4ivm(cv 2 m — s) — 0 
or 

2t v[r 2 + 2m(u> 2 m — s)] = 0 

so that either 

iv = 0 



or 

2 2 

cj 2 = S f = J 2 V 

m 2m 2 0 2m 2 

Thus the displacement resonance occurs at a frequency slightly less than cuo, the 
frequency of velocity resonance. For a small damping constant r or a large mass m these 
two resonances, for all practical purposes, occur at the frequency ivq- 
Denoting the displacement resonance frequency by 

/ s r \ 
r ym 2m 2 ) 

we can write the maximum displacement as 

F 0 

-^max ry 

UJ r^m 



The value of t o r Z m at uj r is easily shown to be equal to c Jr where 



m Am 2 0 4m 2 

The value of x at displacement resonance is therefore given by 

_F± 

X max , 
iu'r 



uj = 





1/2 



where 
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Figure 3.7 Variation of the displacement of a forced oscillator versus driving force frequency uj for 
various values of r 



Since x max = Fo/uj'r at resonance, the amplitude at resonance is kept low by increasing 
r and the variation of x with u> for different values of r is shown in Figure 3.7. A negligible 
value of r produces a large amplification at resonance: this is the basis of high selectivity in 
a tuned radio circuit (see the section in this chapter on Q as an amplification factor). 
Keeping the resonance amplitude low is the principle of vibration insulation. 

(Problems 3.6, 3.7) 

Problem on Vibration Insulation 

A typical vibration insulator is shown in Figure 3.8. A heavy base is supported on a 
vibrating floor by a spring system of stiffness s and viscous damper r. The insulator will 
generally operate at the mass controlled end of the frequency spectrum and the resonant 
frequency is designed to be lower than the range of frequencies likely to be met. Suppose 
the vertical vibration of the floor is given by x = A cos uit about its equilibrium position and 
y is the corresponding vertical displacement of the base about its rest position. The function 
of the insulator is to keep the ratio y/A to a minimum. 

The equation of motion is given by 



my = —r(y — x) — s(y — x) 
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Heavy base 




Equilibrium 
rest position 
of base 



x = A cos (of 

i 



Fixed reference level 



Figure 3.8 Vibration insulator. A heavy base supported by a spring and viscous damper system on a 
vibrating floor 



which, if y — x — X, becomes 



mX + rX + sX = —m'x = mAco 2 cos Lot 

= F 0 COS LOt, 



where 



F o = mAio 2 



Use the steady state solution of X to show that 



y = 



F o 

ijjZ pji 



sin ( oot — (/>)+ A cos cot 



and (noting that y is the superposition of two harmonic components with a constant phase 
difference) show that 



Vmax _ (r 2 +S 2 /u ) 2 ) 1/2 

A Z m 



where 

z l = - s /u ) 2 

Note that 



.V max 



> 1 



if LO 2 < 



2s 



A 



m 
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so that s/m should be as low as possible to give protection against a given frequency u>. 

(a) Show that 

}' max i r 2 ^ 

— — =1 for l o = — 

A m 

(b) Show that 

y max , „ 2 . 

<1 for to > — 

A m 

(c) Show that if lo 2 = s/m , then y mm /A > 1 but that the damping term r is helpful in 
keeping the motion of the base to a reasonably low level. 

(d) Show that if u> 2 > 2s /in, then y max /A < 1 but damping is detrimental. 

Significance of the Two Components of the Displacement Curve 

Any single curve of Figure 3.7 is the superposition of the two component curves (a) and (b) 
in Figure 3.9, for the displacement x may be rewritten 




Figure 3.9 A typical, curve of Figure 3.7 resolved into its 'anti-phase' component (curve (a)) and its 
'90° out of phase' component (curve (b)). Curve (b) represents the resistive fraction of the 
impedance and curve (a) the reactive fraction. Curve (b) corresponds to absorption and curve (a) to 
anomalous dispersion of an electromagnetic wave in a medium having an atomic or molecuLar resonant 
frequency equal to the frequency of the wave 
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or, since 



COS (/) 




and sin f 



X 

z 



m 

m 



as 



x = 



F o 



u;Z rfi 2/ yr 



^ . Fo 

sin cut cos cut 



(jJZj iij 



The coscctf component (with a negative sign) is exactly anti-phase with respect to the 
driving force F o cos c ot. Its amplitude, plotted as curve (a) may be expressed as 

F 0 X m _ Fom(u>Q — to 2 ) , . 

m 2 (u>Q — uj 2 ) 2 + LU 2 r 2 

where lOq — s/m and cop is the frequency of velocity resonance. 

The sinwf component lags the driving force F {] cos cot by 90°. Its amplitude plotted as 
curve (b) becomes 



F o r Fp utr 

wr 2 + X^ m 2 (ujQ — a; 2 )' + co 2 r 2 



We see immediately that at u > o curve (a) is zero and curve (b) is near its maximum but they 
combine to give a maximum at uj where 



UJ = UJr. ~ 



2m 2 



the resonant frequency for amplitude displacement. 

These curves are particularly familiar in the study of optical dispersion where the forced 
oscillator is an electron in an atom and the driving force is the oscillating field vector of an 
electromagnetic wave of frequency uj. When uj is the resonant frequency of the electron in 
the atom, the atom absorbs a large amount of energy from the electromagnetic wave and 
curve fb) is the shape of the characteristic absorption curve. Note that curve (b) represents 
the dissipating or absorbing fraction of the impedance 



r 

and that part of the displacement which lags the driving force by 90°. The velocity 
associated with this component will therefore be in phase with the driving force and it is 
this part of the velocity which appears in the energy loss term rk 2 due to the resistance of 
the oscillator and which gives rise to absorption. 
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On the other hand, curve (a) represents the reactive or energy storing fraction of the 
impedance 

Xm 

C r 2 +Xf u ) 1/ 2 

and the reactive components in a medium determine the velocity of the waves in 
the medium which in turn governs the refractive index n. In fact, curve (a) is a graph of the 
value of n 2 in a region of anomalous dispersion where the ui axis represents the value 
n — 1. These regions occur at every resonant frequency of the constituent atoms of 
the medium. We shall return to this topic later in the book. 

(Problems 3.8, 3.9, 3.10) 

Power Supplied to Oscillator by the Driving Force 

In order to maintain the steady state oscillations of the system the driving force must 
replace the energy lost in each cycle because of the presence of the resistance. We shall 
now derive the most important result that: 

‘in the steady state the amplitude and phase of a driven oscillator adjust themselves so 
that the average power supplied by the driving force just equals that being dissipated by the 
frictional force’. 

The instantaneous power P supplied is equal to the product of the instantaneous driving 
force and the instantaneous velocity, that is, 

F 0 

P = F o cos cot — cos (cut — f) 

'Zjm 

F 2 

= — cos cot cos (cot — <f) 



The average power 



P av = 



' P = 

• • 1 av 



total work per oscillation 
oscillation period 

T P At 

where T — oscillation period 

o T 



F 2 
r o 

7 T 



F 2 

r o 

Z m T 



f T 

cos cot cos (cot — <f) dr 
. o 

T 

[cos 2 cot cos cj) + cos cot sin cot sin f) dr 
. o 



= IL 

2Z 



COS (j) 
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because 



cos ujt x sin ujtdt — 0 



and 



2 . 1 

cos" Lotat = - 

o 2 



The power supplied by the driving force is not stored in the system, but dissipated as 
work expended in moving the system against the frictional force rx. 

The rate of working (instantaneous power) by the frictional force is 



F 1 

Z?. 



(rx)x = rx 2 = r cos 2 (cut — <p) 



and the average value of this over one period of oscillation 



1 rF q 1 F q 

^ = cos cb 

2 Zi 2 Z,„ r 



for — = cos < 
7 

I-j 117 



This proves the initial statement that the power supplied equals the power dissipated. 

In an electrical circuit the power is given by VI cos <fi, where hand I are the instantaneous 
r.m.s. values of voltage and current and cos< j> is known as the power factor. 



VI cos 



V 2 

T e 



COS (f> 




COS (j) 



since 



V = 



Vo 
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(Problem 3.11) 

Variation of P av with co. Absorption Resonance Curve 

Returning to the mechanical case, we see that the average power supplied 

^av = {Fq/2Z,„) COS f 

is a maximum when cos 0=1; that is, when 0=0 and u>m — s/io = 0 or uJq = s/m. The 
force and the velocity are then in phase and Z m has its minimum value of r. Thus 



Pavlmaximum) = F//2r 
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A graph of P av versus uj, the frequency of the driving force, is shown in Figure 3.10. Like 
the curve of displacement versus uj, this graph measures the response of the oscillator; the 
sharpness of its peak at resonance is also determined by the value of the damping constant 
r, which is the only term remaining in Z,„ at the resonance frequency c oq. The peak occurs 
at the frequency of velocity resonance when the power absorbed by the system from the 
driving force is a maximum; this curve is known as the absorption curve of the oscillator 
(it is similar to curve (b) of Figure 3.9). 



The Q-Value in Terms of the Resonance Absorption Bandwidth 

In the last chapter we discussed the quality factor of an oscillator system in terms of energy 
decay. We may derive the same parameter in terms of the curve of Figure 3.10, where the 
sharpness of the resonance is precisely defined by the ratio 



U) 2 — CUi 



where u >2 and uj \ are those frequencies at which the power supplied 

P av = \P m (maximum) 

The frequency difference u >2 — wi is often called the bandwidth. 




Figure 3.10 Graph of average power versus ui supplied to an oscillator by the driving force. 
Bandwidth cu 2 — ui\ of resonance curve defines response in terms of the quality factor, Q = 

ujo/{u>i - u>\), where = s/m 
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Now 

Pay = rFl/lZ^ = \Pay (maximum) = \F\/2r 



when 



Z 



2 

m 



2 r 2 



that is, when 



r 2 + X 2 n = 2 r 2 or X m = ojm — s/co = ±r. 



If u >2 > wi, then 



UJ 2 m — s /^2 — +r 



and 



LOitn — s/u> i — —r 

Eliminating .v between these equations gives 



u >2 — = r/m 



so that 



Q = oj 0 m/r 



Note that ui\ = u>o — r /2m and L 02 = 0 + r/2m are the two significant frequencies in 

Figure 3.9. The quality factor of an electrical circuit is given by 



Q 



LOqL 

Tr - ’ 



where 



^ 2 o = m~ l 

Note that for high values of Q, where the damping constant r is small, the frequency lu' 
used in the last chapter to define Q = ui'm/r moves very close to the frequency ojq, and the 
two definitions of Q become equivalent to each other and to the third definition we meet in 
the next section. 



The Q-Value as an Amplification Factor 

We have seen that the value of the displacement at resonance is given by 



A 



max 



U '2 

— — where lj 
L j'r 



s 



r 



2 



m 4m 2 
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At low frequencies (u> — » 0) the displacement has a value Aq = Fq/s, so that 



Hence: 



for large 0. 




Fq s 2 m 2 u>Q 

uj' 2 r 2 Fq r 2 [ct>Q — r 2 /4m 2 ] 

u>lm 2 _ Q 2 
r 2 [ 1 - l/4g 2 ] 1/2 ] _ [1- 1/4 Q 2 } 



A m ax Q 

M ~ [l- l/4<2 2 ] 1/2 



Q 



l 

80 2 



0 




Figure 3.11 Curves of Figure 3.7 now given in terms of the quality factor Q of the system, where Q 
is amplification at resonance of low frequency response x = Fa/s 
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Thus, the displacement at low frequencies is amplified by a factor of Q at displacement 
resonance. 

Figure 3.7 is now shown as Figure 3.11 where the Q- values have been attached to each 
curve. In tuning radio circuits, the £7- value is used as a measure of selectivity, where 
the sharpness of response allows a signal to be obtained free from interference from signals 
at nearby frequencies. In conventional radio circuits at frequencies of one megacycle. 




(b) 



Figure 3.12 (a) The steady state osciLlation (heavy curve) is modulated by the transient which 

decays exponentially with time, (b) In the vector diagram of (b) OB is the constant Length steady 
state vector and BAi is the transient vector. Each vector rotates anti-clockwise with its own angular 
veLocity. At t = 0 the vectors OB and BA 0 are equal and opposite on the horizontal axis and their 
vector sum is zero. At subsequent times the total amplitude is the Length of OAi which changes as A 
traces a contracting spiral around B. The points Ai, A 2 , A 3 and A 4 indicate how the amplitude is 
modified in (a) 
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^-values are of the order of a few hundred; at higher radio frequencies resonant copper 
cavities have (7- values of about 30 000 and piezo-electric crystals can produce Q - values of 
500000. Optical absorption in crystals and nuclear magnetic resonances are often 
described in terms of (7- values. The Mossbauer effect in nuclear physics involves (7- values 
oflO 10 . 

The Effect of the Transient Term 

Throughout this chapter we have considered only the steady state behaviour without 
accounting for the transient term mentioned on p. 58. This term makes an initial 
contribution to the total displacement but decays with time as e n / 2m . Its effect is best 
displayed by considering the vector sum of the transient and steady state components. 

The steady state term may be represented by a vector of constant length rotating 
anticlockwise at the angular velocity uj of the driving force. The vector tip traces a circle. 
Upon this is superposed the transient term vector of diminishing length which rotates anti 
clockwise with angular velocity u / = ( s/m — r 2 /Am 2 ) 1 / 2 . Its tip traces a contracting spiral. 

The locus of the magnitude of the vector sum of these terms is the envelope of the 
varying amplitudes of the oscillator. This envelope modulates the steady state oscillations 
of frequency u> at a frequency which depends upon uj' and the relative phase between cot 
and uj't. 

Thus, in Figure 3.12(a) where the total oscillator displacement is zero at time t — 0 we 
have the steady state and transient vectors equal and opposite in Figure 3.12(b) but because 
uj uj' the relative phase between the vectors will change as the transient term decays. 
The vector tip of the transient term is shown as the dotted spiral and the total amplitude 
assumes the varying lengths OA[, OA 2 , OA 3 , OA 4 , etc. 

(Problems 3.12, 3.13, 3.14, 3.15, 3.16, 3.17, 3.18) 



Problem 3.1 

Show, if F oe“ represents Fo sinutf in the vector form of the equation of motion for the forced 
oscillator that 



x = 



F 0 



cos 

ujZ m 



(ut - 4>) 



and the velocity 




(ut - (j>) 



Problem 3.2 

The displacement of a forced oscillator is zero at time t = 0 and its rate of growth is governed by the 
rate of decay of the transient term. If this term decays to e ~ k of its original value in a time t show 
that, for small damping, the average rate of growth of the oscillations is given by xo/t = Fo/2kmujo 
where xq is the maximum steady state displacement, F 0 is the force amplitude and Wq = s/m. 
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Problem 3.3 

The equation nix + sx = F q sin tut describes the motion of an undamped simple harmonic oscillator 
driven by a force of frequency u>. Show, by solving the equation in vector form, that the steady state 
solution is given by 



F o sin w? 
w(w § — w 2 ) 



where Wa = — 
m 



Sketch the behaviour of the amplitude of x versus w and note that the change of sign as w passes 
through wo defines a phase change of n rad in the displacement. Now show that the general solution 
for the displacement is given by 



F 0 sinw? , „ . 

x = — xr + A cos wot + B sin wo? 

m(w o — w 2 ) 



where A and B are constant. 



Problem 3.4 

In problem 3.3, ifjc = ±= 0atr = 0 show that 

F () 1 



m (wq — w 2 ) 



sin iot sin wot 

w 0 



and, by writing w = wo + Aw where Aw/wo -C 1 and Aw? <C 1, show that near resonance, 

Fq . . , 

x = sin wo? — wo? cos wo?) 

2i«Wq 

Sketch this behaviour, noting that the second term increases with time, allowing the oscillations to 
grow (resonance between free and forced oscillations). Note that the condition Aw? <C 1 focuses 
attention on the transient. 



Problem 3.5 

What is the general expression for the acceleration v of a simple damped mechanical oscillator 
driven by a force F o cos wt? Derive an expression to give the frequency of maximum acceleration 
and show that if r = y/sm, then the acceleration amplitude at the frequency of velocity resonance 
equals the limit of the acceleration amplitude at high frequencies. 

Problem 3.6 

Prove that the exact amplitude at the displacement resonance of a driven mechanical oscillator may 
be written x = Fo/u'r where F o is the driving force amplitude and 

a s >' 2 

w = — ~ 

m 4m ~ 



Problem 3.7 

In a forced mechanical oscillator show that the following are frequency independent (a) the 
displacement amplitude at low frequencies (b) the velocity amplitude at velocity resonance and (c) 
the acceleration amplitude at high frequencies, (w — > oo). 
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Problem 3.8 

In Figure 3.9 show that for small r, the maximum value of curve (a) is ~ Fo/lujor at 
u>i = luq — r/2m and its minimum value is ~ —Fo/2uior at uii = u>o + r/2m. 

Problem 3.9 

The equation ir + w 5 x= (— eEo/m) cos ivt describes the motion of a bound undamped electric 
charge — e of mass m under the influence of an alternating electric field E = Eq cos cot. For an 
electron number density n show that the induced polarizability per unit volume (the dynamic 
susceptibility) of a medium 



nex ne 2 

Xe £ 0 E £ 0 l?l(u>l - LO 2 ) 

(The permittivity of a medium is defined as e = £o( 1 + x) where eq is the permittivity of free space. 
The relative permittivity £ r = e/eo is called the dielectric constant and is the square of the refractive 
index when E is the electric field of an electromagnetic wave.) 

Problem 3.10 

Repeat Problem 3.9 for the case of a damped oscillatory electron, by taking the displacement x as the 
component represented by curve (a) in Figure 3.9 to show that 

£r _ i + x=i + — — 

£q [m 2 (u^ — uj 2 ) 2 + u) 2 r 2 ] 

In fact. Figure 3.9(a) plots £ r = e/eq. Note that for 



LU<^LU 0, £ r 



1 + 



Eoinul 



and for 



0 , Sr 



1 - 



£o mui" 



Problem 3.11 

Show that the energy dissipated per cycle by the frictional force rx at an angular frequency u> is given 

by nru>x 2 mm . 



Problem 3.12 

Show that the bandwidth of the resonance absorption curve defines the phase angle range 
tan <j) = ± 1 . 



Problem 3.13 

An alternating voltage, amplitude Vq is applied across an LCR series circuit. Show that the voltage at 
current resonance across either the inductance or the condenser is QV q. 
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Problem 3.14 

Show that in a resonant LCR series circuit the maximum potential across the condenser occurs at a 
frequency u> = wo(l — 1/2<2 q) where Wq = ( LC ) _1 and go = uqL/R. 

Problem 3.15 

In Problem 3.14 show that the maximum potential across the inductance occurs at a frequency 

W = W 0 (l - 1/2G§) -!/2 

Problem 3.16 

Light of wavelength 0.6 pm (6000 A) is emitted by an electron in an atom behaving as a lightly 
damped simple harmonic oscillator with a g- value of 5 x 10 7 . Show from the resonance bandwidth 
that the width of the spectral line from such an atom is 1.2 x 10 ~ u m. 

Problem 3.17 

If the g-value of Problem 3.6 is high show that the width of the displacement resonance curve is 
approximately \/3 r/m where the width is measured between those frequencies where x = x max /2. 

Problem 3.18 

Show that, in Problem 3.10, the mean rate of energy absorption per unit volume; that is, the power 
supplied is 



ne 2 El u> 2 r 

2 m 2 (wQ-w 2 ) 2 + w 2 r 2 



Summary of Important Results 

Mechanical Impedance Z,„ = F/v (force per unit velocity) 

Z m = Z m e 1 ^ = r + i {cum — s/u>) 



where Z 2 — r 2 + (c om - s/uS) 2 

ojm — s/uj 



sin 0 = 



cos 0 = 



tang = 



MM — s/oj 



is the phase angle between the force and velocity. 



Forced Oscillator 

Equation of motion mx + rx + sx = F o cos cut 
(Vector form) mx + rx + sx = F o e luJ! 

Use x = Ae M to give steady state displacement 

x = -i— e i(w; -^ 
UjZm 
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and velocity 



i = v = — 
7 

z-j m 



When Foe 1UJt represents Fq cos cjt 



x = — — sin (cut — d>) 
OjZ m 

V = — — cos (ujt — 4>) 



Maximum velocity = — at velocity resonant frequency cuo = (s/m ) 12 

Maximum displacement where u>' = (s/m — r 1 /Am 1 ) l,/2 at 

u>'r 

resonant frequency to = ( s/m — r 2 /2m 1 ) 



displacement 



Power Absorbed by Oscillator from Driving Force 

Oscillator adjusts amplitude and phase so that power supplied equals power dissipated. 
Power absorbed = j ( F^/Z ,„) cos <}> (cos cj> is power factor) 

F 2 

Maximum power absorbed = — at tuo 

2 r 

Maxmium power Ft, r r 

absorbed = — at cui = can and tu? = wn H 

2 4r 2m 2m 

Quality factor Q = = — — — 

r L 02 ~ Wi 

maximum displacement at displacement resonance 
displacement as u> — > 0 

A (max) 

F a /s 



For equivalent expressions for electrical oscillators replace m by L, r by R, s by 1/C and F o 
by Vo (voltage). 

f\0AJ(03 

7:02 !•■». 7 / 5/05 
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Coupled Oscillations 



The preceding chapters have shown in some detail how a single vibrating system will 
behave. Oscillators, however, rarely exist in complete isolation; wave motion owes its 
existence to neighbouring vibrating systems which are able to transmit their energy to each 
other. 

Such energy transfer takes place, in general, because two oscillators share a common 
component, capacitance or stiffness, inductance or mass, or resistance. Resistance coupling 
inevitably brings energy loss and a rapid decay in the vibration, but coupling by either of 
the other two parameters consumes no power, and continuous energy transfer over many 
oscillators is possible. This is the basis of wave motion. 

We shall investigate first a mechanical example of stiffness coupling between two 
pendulums. Two atoms set in a crystal lattice experience a mutual coupling force and 
would be amenable to a similar treatment. Then we investigate an example of mass, or 
inductive, coupling, and finally we consider the coupled motion of an extended array of 
oscillators which leads us naturally into a discussion on wave motion. 

Stiffness (or Capacitance) Coupled Oscillators 

Figure 4.1 shows two identical pendulums, each having a mass m suspended on a light rigid 
rod of length /. The masses are connected by a light spring of stiffness s whose natural 
length equals the distance between the masses when neither is displaced from equilibrium. 
The small oscillations we discuss are restricted to the plane of the paper. 

If x and y are the respective displacements of the masses, then the equations of 
motion are 

mix = —mg- — s(x — y) 

and 

my = —mg- + s(x — y) 
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cppmvmb 



Figure 4.1 Two identical pendulums, each a light rigid rod of length / supporting a mass m and 
coupled by a weightless spring of stiffness s and of natural Length equal to the separation of the 
masses at zero displacement 



These represent the normal simple harmonic motion terms of each pendulum plus a coup- 
ling term s(x — y) from the spring. We see that if x > y the spring is extended beyond its 
normal length and will act against the acceleration of x but in favour of the acceleration of y. 
Writing cuq = g/l, where _j () is the natural vibration frequency of each pendulum, gives 

X + ujIx = - — (x - v) (4.1) 

m 

y + uj 2 0 y = --{y-x) (4.2) 

m 

Instead of solving these equations directly for x and y we are going to choose two new 
coordinates 



X — x 4 - y 
Y = x — y 

The importance of this approach will emerge as this chapter proceeds. Adding equations 
(4.1) and (4.2) gives 



x + y + 0 l> 1 (x + y) = 0 



that is 



X + uj 2 X = 0 



and subtracting (4.2) from (4.1) gives 

Y + (cuq + 2s/m)Y = 0 

The motion of the coupled system is thus described in terms of the two coordinates X and K 
each of which has an equation of motion which is simple harmonic. 

If Y — 0, x — y at all times, so that the motion is completely described by the equation 

X + ut 2 0 X = 0 

then the frequency of oscillation is the same as that of either pendulum in isolation and the 
stiffness of the coupling has no effect. This is because both pendulums are always swinging 
in phase (Figure 4.2a) and the light spring is always at its natural length. 
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Figure 4.2 (a) The 'in phase' mode of vibration given by X + uj^X = 0, where X is the normal 

coordinate X = x +y and = g/l. (b) 'Out of phase' mode of vibration given by / + (u>q + 2 s/m) 
where Y is the normal coordinate Y = x — y 



If X = 0, x = — y at all times, so that the motion is completely described by 

Y + (wq + 2s/m)Y = 0 

The frequency of oscillation is greater because the pendulums are always out of phase 
(Figure 4.2b) so that the spring is either extended or compressed and the coupling is 
effective. 

Normal Coordinates, Degrees of Freedom and Normal Modes 
of Vibration 

The significance of choosing X and Y to describe the motion is that these parameters give a 
very simple illustration of normal coordinates. 

• Normal coordinates are coordinates in which the equations of motion take the form of a 
set of linear differential equations with constant coefficients in which each equation 
contains only one dependent variable (our simple harmonic equations in X and T). 

• A vibration involving only one dependent variable X (or Y) is called a normal mode of 
vibration and has its own normal frequency. In such a normal mode all components of 
the system oscillate with the same normal frequency. 

• The total energy of an undamped system may be expressed as a sum of the squares of 
the normal coordinates multiplied by constant coefficients and a sum of the squares of 
the first time derivatives of the coordinates multiplied by constant coefficients. The 
energy of a coupled system when the X and Y modes are both vibrating would then be 
expressed in terms of the squares of the velocities and displacements of X and Y. 

• The importance of the normal modes of vibration is that they are entirely independent 
of each other. The energy associated with a normal mode is never exchanged with 
another mode; this is why we can add the energies of the separate modes to give the 
total energy. If only one mode vibrates the second mode of our system will always be at 
rest, acquiring no energy from the vibrating mode. 

• Each independent way by which a system may acquire energy is called a degree of 
freedom to which is assigned its own particular normal coordinate. The number of such 
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different ways in which the system can take up energy defines its number of degrees of 
freedom and its number of normal coordinates. Each harmonic oscillator has two 
degrees of freedom, it may take up both potential energy (normal coordinate X) and 
kinetic energy (normal coordinate X). In our two normal modes the energies may be 
written 



E x = aX 2 + bX 2 

and 

E y = cY 2 +dY 2 



where a , b, c and d are constant. 



(4.3a) 



(4.3b) 



Our system of two coupled pendulums has, then, four degrees of freedom and four 
normal coordinates. 

Any configuration of our coupled system may be represented by the super-position of the 
two normal modes 



X = x + y = Xq cos (wj t + 4>i) 



and 



Y — x - y = To cos (ca 2 t + fa) 

where Xq and Yq are the normal mode amplitudes, whilst uj\ = g/l and uj\ = (g/l + Is/ m) 
are the normal mode frequencies. To simplify the discussion let us choose 



Xq — Yq = 2 a 



and put 



<t> i — 4> 2 — 0 

The pendulum displacements are then given by 

x = \ (X + Y) = ucoscuil + acosu> 2 t 



and 



y = /(X — Y) = acosuqf — acosu^f 



with velocities 



x = —a lji sinu>] / — au >2 sino^ 



y = —acoi sintuif + au > 2 sin u> 2 t 



and 
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y = 0 ► 2 a < ► a < *■ a < >-a < *■ a < 



Figure 4.3 The displacement of one pendulum by an amount 2 a is shown as the combination of the 
two normal coordinates X + Y 

Now let us set the system in motion by displacing the right hand mass a distance x = 2 a 
and releasing both masses from rest so that x. — y = 0 at time t = 0. 

Figure 4.3 shows that our initial displacement x = 2a, y — 0 at t = 0 may be seen as a 
combination of the ‘in phase’ mode (x = y = a so that x + y = Xq = 2a) and of the ‘out of 
phase’ mode (x = — y = a so that Yq = 2a). After release, the motion of the right hand 
pendulum is given by 



x = rzcoscuif + acostV 2 t 

(u>2 — Wi)t (u>\+U>2)t 

= 2 a cos cos 

2 2 



and that of the left hand pendulum is given by 



y = acoscujf — acosu> 2 t 

. (u>i~ Wi)l . (ui\+UJ2)t 

= —2a sin sin 

2 2 

. (u >2 ~ u\)t . (at i + t o 2 )t 

= 2 a sin sin 

2 2 



If we plot the behaviour of the individual masses by showing how x and y change with time 
(Figure 4.4), we see that after drawing the first mass aside a distance 2 a and releasing it x 
follows a consinusoidal behaviour at a frequency which is the average of the two normal 
mode frequencies, but its amplitude varies cosinusoidally with a low frequency which is 
half the difference between the normal mode frequencies. On the other hand, y, which 
started at zero, vibrates sinusoidally with the average frequency but its amplitude builds up 
to 2 a and then decays sinusoidally at the low frequency of half the difference between the 
normal mode frequencies. In short, the y displacement mass acquires all the energy of the x 
displacement mass which is stationary when y is vibrating with amplitude 2a, but the 
energy is then returned to the mass originally displaced. This complete energy exchange is 
only possible when the masses are identical and the ratio (yj\ + 0 J 2 ) /(loi -wi) is an 
integer, otherwise neither will ever be quite stationary. The slow variation of amplitude at 
half the normal mode frequency difference is the phenomenon of ‘beats’ which occurs 
between two oscillations of nearly equal frequencies. We shall discuss this further in the 
section on wave groups in Chapter 5. 
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Figure 4.4 Behaviour with time of individual pendulums, showing compLete energy exchange 
between the pendulums as x decreases from 2 a to zero whilst y grows from zero to 2 a 



The important point to recognize, however, is that although the individual pendulums 
may exchange energy, there is no energy exchange between the normal modes. Figure 4.3 
showed the initial configuration x = 2 a, y — 0, decomposed into the X and Y modes. The 
higher frequency of the Y mode ensures that after a number of oscillations the Y mode will 
have gained half a vibration (a phase of 7 r rad) on the X mode; this is shown in Figure 4.5. 
The combination of the X and Y modes then gives y the value of 2 a and x = 0, and the 
process is repeated. When Y gains another half vibration then x equals 2a again. The 
pendulums may exchange energy; the normal modes do not. 

To reinforce the importance of normal modes and their coordinates let us return to 
equations (4.3a) and (4.3b). If we modify our normal coordinates to read 



Xq — 




and Y q 





y) 




X 




Y 




Figure 4.5 The faster vibration of the Y mode results in a phase gain of 7r rad over the X mode of 
vibration, to givey = 2 a, which is shown here as a combination of the normal modes X — Y 
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then we find that the kinetic energy in those equations becomes 

Ek = T = aX 2 + cY 2 = l -X 2 q + l -Yl 



and the potential energy 



v = 6x 2 + rfy 2 = i(|)xj + f 0 

_ 1 ,2 v 2 I 1 ,2y2 

- 2 UJ 0 X i, + 




(4.4a) 



(4.4b) 



where — g/Z and cu 2 = g/l + 2 s/m. 

Note that the coefficients of X 2 and Y 2 depend only on the mode frequencies and that the 
properties of individual parts of the system are no longer explicit. 

The total energy of the system is the sum of the energies of each separate excited mode 
for there are no cross products X q Y q in the energy expression of our example, i.e., 



E = T + V = 



fxi 



, ,2 y 2 

2, UJ ° Xq 





Atoms in polyatomic molecules behave as the masses of our pendulums; the normal 
modes of two triatomic molecules CO 2 and H 2 O are shown with their frequencies in 
Figure 4.6. Normal modes and their vibrations will occur frequently throughout this book. 



C0 2 

<-©-»- <-©->• 












co 1 =4.16 x 10 13 sec 1 



co 2 = 7.05 x 10 13 sec 1 



co 3 = 2 x 10 13 sec 1 




Figure 4.6 Normal modes of vibration for triatomic molecules CO 2 and H 2 0 
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The General Method for Finding Normal Mode Frequencies, 
Matrices, Eigenvectors and Eigenvalues 

We have just seen that when a coupled system oscillates in a single normal mode each 
component of the system will vibrate with frequency of that mode. This allows us to adopt 
a method which will always yield the values of the normal mode frequencies and the 
relative amplitudes of the individual oscillators at each frequency. 

Suppose that our system of coupled pendulums in the last section oscillates in only one 
of its normal modes of frequency uj. 

Then, in the equations of motion 

nix + mg(x/ 1) + s{x — y) = 0 

and 

my + mg(y/l ) — s(x — y) — 0 

If the pendulums start from test, we may assume the solutions 

x = Ae iu * 
y = B e iuJf 

where A and B are the displacement amplitudes of x and v at the frequency lo. Using these 
solutions, the equations of motion become 

[— moo 2 A + ( mg/l)A + si A — 6)1 e luJf = 0 

(4.5) 

[— muj 2 B + ( mg/l)B — s(A — B)] e lut — 0 



The sum of these expressions gives 

(A + 6)(— muo 2 + nig /l) — 0 

which is satisfied when lo 2 = g/l, the first normal mode frequency. The difference between 
the expressions gives 

(A — B)(—mu> 2 + nig/l + 2s) = 0 

which is satisfied when io 2 = g/l + 2s/ m, the second normal mode frequency. 

Inserting the value u) 2 = g/l in the pair of equations gives A — B (the ‘in phase’ 
condition), whilst lo 2 = g/l + 2 s/m gives A = —6 (the antiphase conditon). 

These are the results we found in the previous section. 

We may, however, by dividing through by m e rewrite equation (4.5) in matrix form as 



U>1+UJ 2 -UJ 2 I r A I = 2 TA 

-tu? + [B 

where 



(4.6) 




Til 
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This is called an eigenvalue equation. The value of u> 2 for which non-zero solutions exist 
are called the eigenvalues of the matrix. The column vector with components A and B is an 
eigenvector of the matrix. 

Equation (4.6) may be written in the alternative form 



(ujI + uj 2 - (V 2 ) 



—U) c 



-v 2 s A 
(u> 1 + u> 2 - u> 2 ) \ [flj 



= 0 



(4.7) 



and these equations have a non-zero solution if and only if the determinant of the matrix 
vanishes; that is, if 

(< '4+“ 2 s-“ 2 ) 2 -“ 4 s = o 

or 

(cUy + UJ 2 S ~ UJ 2 ) = ±UJ 2 S 

i.e. 

cUj = or — ijj 2 0 + 2 uj 2 

as we expect. 

The solution t o\ = cuy in equation (4.6) yields A = B as previously and uj 2 = cuy + 2 uj 2 
yields A = —B. 

Because the system started from rest we have been able to assume solutions of the 
simple form 

x = a e ut 

y = Be“' 

When the pendulums have an initial velocity at t — 0, the boundary conditions require 
solutions of the form 

x = Ae i(uJ,+Qi) 

y = 5e i ( UJ,+av ) 



where each normal mode frequency uj has its own particular value of the phase constant a . 
The number of adjustable constants then allows the solutions to satisfy the arbitrary values 
of the initial displacements and velocities of both pendulums. 



(Problems 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 4.10, 4.11) 



Mass or Inductance Coupling 

In a later chapter we shall discuss the propagation of voltage and current waves along a 
transmission line which may be considered as a series of coupled electrical oscillators 
having identical values of inductance and of capacitance. For the moment we shall consider 
the energy transfer between two electrical circuits which are inductively coupled. 
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A mutual inductance (shared mass) exists between two electrical circuits when the 
magnetic flux from the current flowing on one circuit threads the second circuit. Any 
change of flux induces a voltage in both circuits. 

A transformer depends upon mutual inductance for its operation. The power source is 
connected to the transformer primary coil of n p turns, over which is wound in the same 
sense a secondary coil of n s turns. If unit current flowing in a single turn of the primary coil 
produces a magnetic flux <f>, then the flux threading each primary turn (assuming no flux 
leakage outside the coil) is n p (f) and the total flux threading all n p turns of the primary is 



L p = 




where L p is the self inductance of the primary coil. If unit current in a single turn of the 
secondary coil produces a flux f, then the flux threading each secondary turn is n s <f> and the 
total flux threading the secondary coil is 



L s 




where L s is the self inductance of the secondary coil. 

If all the flux lines from unit current in the primary thread all the turns of the secondary, 
then the total flux lines threading the secondary defines the mutual inductance 

M n s (n p <t>) \/L p L v 

In practice, because of flux leakage outside the coils, M < ^ jL p L s and the ratio 

— = k , the coefficient of coupling. 

yJL p L s 

If the primary current I p varies with e luJt , a change of I p gives an induced voltage 
—Lpdlp/ d t = —iioLIp in the primary and an induced voltage —Mdl p /dt = —it oMl p in the 
secondary. 

If we consider now the two resistance-free circuits of Figure 4.7, where L\ and Li are 
coupled by flux linkage and allowed to oscillate at some frequency uj (the voltage and 
current frequency of both circuits), then the voltage equations are 

iojLJi — i— I\ + iu>MI 2 — 0 (4.8) 

UtC i 





M 



Figure 4.7 Inductively (mass) coupled LC circuits with mutual inductance M 
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and 



iujL 2 I 2 ~ 

where M is the mutual inductance. 
Multiplying (4.8) by uj/\ L\ gives 

uj 2 I\ - 

and multiplying (4.9) by uj/iL 2 gives 

co 2 I 2 ~ 



i 1 2 iioMI \ — 0 

ujC 2 



h 

LiC x 



M 

Ll 



uj 2 I 2 = 0 



h 

l 2 C 2 



M ? 

—uj 1 1 = 0, 

L 2 



(4.9) 



where the natural frequencies of the circuit cu 2 = 1 / L\C\ and lo\ = I/L 2 C 2 give 

(lo 2 -lo 2 )I 1 =^-co 2 I 2 (4.10) 

L \ 

and 

{LU 2 -W 2 )l2 = M-U 2 h (4.11) 

c 2 

The product of equations (4.10) and (4.11) gives 

M 2 

(uj 2 - ui 2 )(u } 2 - uj 2 ) = — — u > 4 = k 2 ui 4 , (4.12) 

L\L 2 

where k is the coefficient of coupling. 

Solving for ut gives the frequencies at which energy exchange between the circuits 
allows the circuits to resonate. If the circuits have equal natural frequencies uq = <v 2 — tuo, 
say, then equation (4.12) becomes 



or 



(2 2\2 ,24 

(LOq — U) ) = k U) 

(u>0 - w 2 ) = ±ku 2 



that is 



uj = 



± 



UJ 0 

y/l zh k 



The positive sign gives two frequencies 



UJ 



/ 



UJ 0 

VTTk 



A ,f 

and uj = - . 

y/l-k 



at which, if we plot the current amplitude versus frequency, two maxima appear (Figure 4.8). 
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Figure 4.8 Variation of the current amplitude in each circuit near the resonant frequency. A smaLl 
resistance prevents the amplitude at resonance from reaching infinite values but this has been 
ignored in the simple analysis. Flattening of the response curve maximum gives 'frequency band pass' 
coupling 



In loose coupling k and M are small, and u>' « to" « loq, so that both systems behave 
almost independently. In tight coupling the frequency difference t o" — uj' increases, the 
peak values of current are displaced and the dip between the peaks is more pronounced. In 
this simple analysis the effect of resistance has been ignored. In practice some resistance is 
always present to limit the amplitude maximum. 

(Problems 4.12, 4.13, 4.14, 4.15, 4.16) 



Coupled Oscillations of a Loaded String 

As a final example involving a large number of coupled oscillators we shall consider a light 
string supporting n equal masses m spaced at equal distance a along its length. The string is 
fixed at both ends; it has a length (n + 1 )a and a constant tension T exists at all points and 
all times in the string. 

Small simple harmonic oscillations of the masses are allowed in only one plane and the 
problem is to find the frequencies of the normal modes and the displacement of each mass 
in a particular normal mode. 

This problem was first treated by Lagrange, its particular interest being the use it makes 
of normal modes and the light it throws upon the wave motion and vibration of a 
continuous string to which it approximates as the linear separation and the magnitude of the 
masses are progressively reduced. 

Figure 4.9 shows the displacement y r of the r th mass together with those of its two 
neighbours. The equation of motion of this mass may be written by considering the 
components of the tension directed towards the equilibrium position. The r th mass is 
pulled downwards towards the equilibrium position by a force T sin 6 [ , due to the tension 
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YrYr - 1 m Yr-Yr+1 




Figure 4.9 Displacements of three masses on a loaded string under tension T giving equation of 
motion my r = T(y r+1 - 2 y r + y r -i)/a 



on its left and a force T sin 6 2 due to the tension on its right where 

. y r -y r - 1 

sin 0 1 = 



and 



sin O2 — 



y r -y r + 1 

a 



Hence the equation of motion is given by 

d 2 y 



, 2 = -T (sin 6 1 + sin 0 2 ) 



= -T 



f y r y r— 1 y r -y r +A 
V a a ) 



so 



d ^r = yr = — (yr-i-2y r +y r+ i) ( 4 . 13 ) 

d t z ma 

If, in a normal mode of oscillation of frequency u>, the time variation of y r is simple 
harmonic about the equilibrium axis, we may write the displacement of the r th mass in this 
mode as 



y r — A r e lw/ 

where A r is the maximum displacement. Similarly y r +i = A r+ \ e ljJ ' and y i = A r - 1 e . 
Using these values of y in the equation of motion gives 

-uj 2 A r e lut — — (A,._ 1 — 2 A r + A r+ i)e lwr 
ma 



or 



—A r - 1 




mauj ' 



A r — A r+ 1 



= 0 



( 4 . 14 ) 



This is the fundamental equation. 
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The procedure now is to start with the first mass r = 1 and move along the string, writing 
out the set of similar equations as r assumes the values r = 1,2,3, ... ,n remembering that, 
because the ends are fixed 



y 0 = A 0 = 0 and y n+1 = A„ +1 = 0 



Thus, when r = 1 the equation becomes 

( tnauj 2 \ 

(2-—j—)A l -A 2 =0 (Aq = 0) 

When r — 2 we have 



-A, 




mau> 



At — A3 — 0 



and when r = n we have 



An— 1 




0 (A n+ 1 — 0) 



Thus, we have a set of n equations which, when solved, will yield n different values of W, 
each value of lo being the frequency of a normal mode, the number of normal modes being 
equal to the number of masses. 

The formal solution of this set of n equations involves the theory of matrices. However, 
we may easily solve the simple cases for one or two masses on the string (n — I or 2) and, 
in additon, it is possible to show what the complete solution for n masses must be without 
using sophisticated mathematics. 

First, when n — 1 , one mass on a string of length 2a, we need only the equation for r = 1 
where the fixed ends of the string give Aq — A 2 — 0. 

Hence we have 




maw 



A 1 = 0 



giving 

, 2 T 

w = — 
ma 

a single allowed frequency of vibration (Figure 4.10a). 

When n = 2, string length 3 a (Figure 4.10b) we need the equations for both r — 1 and 
r = 2; that is 




-a 2 = o 
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m 




(a) 



n = 1 



2 



(0 



27 

ma 




Figure 4.10 (a) Normal vibration of a single mass m on a string of length 2 a at a frequency 

u! 2 = 2T/ma. (b) Normal vibrations of two masses on a string of length 3 a showing the loose coupled 
'in phase' mode of frequency cj 2 = T /ma and the tighter coupled 'out of phase' mode of frequency 
= 3 T /ma. The number of normal modes of vibration equals the number of masses 



and 



- A|+ ( 2 -=!) A2 = 0 (A 0 =A,= 0) 

Eliminating A ] or At shows that these two equations may be solved (are consistent) 
when 



that is 



mauj‘ 



2 

- 1=0 




maaj 2 

T 




mauj 2 

T 




= 0 



Thus, there are two normal mode frequencies 



LO 



2 

1 



T 



3 T 



ma 



and 



ma 




94 



Coupled Oscillations 



Using the values of uq in the equations for r = 1 and r — 2 gives A i = A 2 the slow ‘in 
phase’ oscillation of Figure 4.10b, whereas u >2 gives A\ = — Ai the faster ‘anti-phase’ 
oscillation resulting from the increased coupling. 

To find the general solution for any value of n let us rewrite the equation 



—A r - 1 




mauj‘ 



A, — A r+ 1 



- 0 



in the form 



A r -\ + A r+ \ — — u> 2 

A ,. u)q 



where 



tUn = 



T 

ma 



We see that for any particular fixed value of the normal mode frequency Lu(u>j say) the 
right hand side of this equation is constant, independent of r, so the equation holds for all 
values of r. What values can we give to A which will satisfy this equation, meeting the 
boundary conditions Ao = A „ + 1 = 0 at the end of the string? 

Let us assume that we may express the amplitude of the rth mass at the frequency loj as 

A,. = Ce ir0 



where C is a constant and 9 is some constant angle for a given value of ojj. The left hand 
side of the equation then becomes 

Ar-i +A r+1 _ Cte^ + e^ 1 )*) _ i0 w 
A r _ CO'-o j 

= 2 cos 6 



which is constant and independent of r. 

The value of 9j (constant at uj) is easily found from the boundary conditions 

^0 = A n+ \ = 0 



which, using sin r9 from c" 0 gives 



Aq = C sin rO = 0 (automatically at r = 0) 



and 



A n+ i = C sin (n + 1)9 — 0 



when 



(n + 1) 9j — jit for j — 1,2, ... ,n 
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Hence 



Oj = 



n + 1 



and 



A r = C sin r9j = C sin 



rjn 
n + 1 



which is the amplitude of the rth mass at the fixed normal mode frequency luj. 
To hnd the allowed values of ujj we write 



A r - 1 + A r+ i 

T r 



2 cuq — ujj 



OJt 



2 cos Oj — 2 cos 



J7T 

n + 1 



giving 



2 o 2 

iOj =2 w 0 



— COS 



n + 1 



(4.15) 



where j may take the values j = 1,2 and = T /ma. 

Note that there is a maximum frequency of oscillation ujj = 2 ojq. This is called the ‘cut 
off’ frequency and such an upper frequency limit is characteristic of all oscillating systems 
composed of similar elements (the masses) repeated periodically throughout the structure 
of the system. We shall meet this in the next chapter as a feature of wave propagation in 
crystals. 

To summarize, we have found the normal modes of oscillation of n coupled masses on 
the string to have frequencies given by 



-J = 



2 T 



1 — cos 



J 71 " 



(j 



1,2,3 ...n) 



At each frequency ujj the r th mass has an amplitude 



A r = C sin 



rjtt 
n + 1 



where C is a constant. 

(Problems 4.17, 4.18, 4.19, 4.20, 4.21, 4.22) 

The Wave Equation 

Finally, in this chapter, we show how the coupled vibrations in the periodic structure of our 
loaded string become waves in a continuous medium. 
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We found the equation of motion of the r th mass to be 

^2 y 

= Cvr+i — + Jr-t) (4.13) 

at 1 ma 

We know also that in a given normal mode all masses oscillate with the same mode 
frequency lu, so all y r ’s have the same time dependence. However, as we see in Fig- 
ure 4.10(b) where A i and Ai are anti-phase, the transverse displacement y r also depends 
upon the value of r; that is, the position of the r th mass on the string. In other words, y r is a 
function of two independent variables, the time t and the location of r on the string. 

If we use the separation a « 8x and let 8x — » 0, the masses become closer and we can 
consider positions along the string in terms of a continuous variable x and any transverse 
displacement as y(x, t), a function of both x and t. 

The partial derivative notation dy(x,t)/dt expresses the variation with time of y(x,t ) 
while x is kept constant. 

The partial derivative dy(x, t ) /Ox expresses the variation with x of y(x, t) while the time t 
is kept constant. (Chapter 5 begins with an extended review of this process for students 
unfamiliar with this notation.) 

In the same way, the second derivative d 2 y{x, t) jdt 2 continues to keep x constant and 
d 2 y(x,t)/dx 2 keeps t constant. 

For example, if 



y = eK^+kx) 



then 





dy 


i Loe i{ut+kx) 


= i ujy 


and 


d 2 y 

dt 2 


while 


dy 


: ike i{ut+kx) 


1 


and 


d 2 y 




dx 


— i/cy 


dx 2 



If we now locate the transverse displacement y r at a position x = x r along the string, 
then the left hand side of equation (4.13) becomes 

d 2 y r d 2 y 

where y is evaluated at x = x r and now, as a — 8x — 0, we may write x r = x,x r+ \ — 
x + 8x and x r ^i—x — 8x with y r {t) —* y(x, t),y r+ \(t) — > y(x + 8x, t) and y r _i(f) — > 
y(x — 8x, t). 

Using a Taylor series expansion to express y(x ± 6x, t) in terms of partial derivates of y 
with respect to x we have 



y(x ± 8x, t ) — y(x) ± 8x 



dy 

dx 



V ±6x) 



2 fry 

dx 2 
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and equation (4.13) becomes after substitution 



d 2 y _ T_ / y r+ 1 — y r _ y r -y r - 1 
dt 2 m\ a a 



fc - d y , 1 



i d 2 y 



dy 1 



d 2 y \ 



& S + 2< & > 5? 



m 









so 



d 2 y T (Sx) 2 d 2 y T d 2 ? 

dt 2 m 8x dx 2 m X dx 2 



If we now write m — p8 x where p is the linear density (mass per unit length) of the 
string, the masses must — 4) as 8x — -0 to avoid infinite mass density. Thus, we have 



d 2 y T d 2 y 
dt 2 p dx 2 



This is the Wave Equation. 

T / p has the dimensions of the square of a velocity, the velocity with which the waves; 
that is, the phase of oscillation, is propagated. The solution for y at any particular point 
along the string is always that of a harmonic oscillation. 



(Problem 4.23) 



Problem 4.1 

Show that the choice of new normal coordinates X q and Y q expresses equations (4.3a) and (4.3b) as 
equations (4.4a) and (4.4b). 

Problem 4.2 

Express the total energy of Problem 4. 1 in terms of the pendulum displacements x and y as 

E = (fi'kin T Epof') x + (Elan T Ep 0 t) y + (Epot)^, 

where the brackets give the energy of each pendulum expressed in its own coordinates and (E pot ) 
is the coupling or interchange energy involving the product of these coordinates. 

Problem 4.3 

Figures 4.3 and 4.5 show how the pendulum configurations x = 2a, y = 0 and x = 0, y = 2a result 
from the superposition of the normal modes X and Y. Using the same initial conditions 
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(x = 2a, y = 0, x = y = 0) draw similar sketches to show how X and Y superpose to produce 
x = —2 a,y = 0 and x = 0, _y = —2a. 

Problem 4.4 

In the figure two masses m \ and m 2 are coupled by a spring of stiffness s and natural length /. If x is 
the extension of the spring show that equations of motion along the x axis are 



m 1 X 1 = sx 



and 



1112 X 2 = — sx 



and combine these to show that the system oscillates with a frequency 




where 



m imj 
in 1+7/7 2 



is called the reduced mass. 

The figure now represents a diatomic molecule as a harmonic oscillator with an effective mass 
equal to its reduced mass. If a sodium chloride molecule has a natural vibration frequency 
= 1.14 x 10 13 Hz (in the infrared region of the electromagnetic spectrum) show that the interatomic 
force constant s = 120 Nm~' (this simple model gives a higher value for .? than more refined 
methods which account for other interactions within the salt crystal lattice) 

Mass of Na atom = 23 a.m.u. 

Mass of Cl atom = 35 a.m.u. 

1 a.m.u. = 1.67xl0“ 27 kg 



> / < 




x 2 



Problem 4.5 

The equal masses in the figure oscillate in the vertical direction. Show that the frequencies of the 
normal modes of oscillation are given by 

co 2 = (3 ± y/S) — S — 

2 777 

and that in the slower mode the ratio of the amplitude of the upper mass to that of the lower mass is 
j (\/5 — 1) whilst in the faster mode this ratio is — | (\/5 +1). 
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In the calculations it is not necessary to consider gravitational forces because they play no part in 
the forces responsible for the oscillation. 

Problem 4.6 

In the coupled pendulums of Figure 4.3 let us write the modulated frequency ut m = ( u >2 — u>\)/2 and 
the average frequency u> a = {u >2 + u>\)/2 and assume that the spring is so weak that it stores a 
negligible amount of energy. Let the modulated amplitude 

2acosuj m t or 2asmui m t 

be constant over one cycle at the average frequency c o a to show that the energies of the masses may 
be written 

E x = 2ma 2 uj 2 cos 2 ui m t 

and 

E y = 2 ma 2 u 2 sin 2 u> m t 

Show that the total energy E remains constant and that the energy difference at any time is 

E x — E y = E cos («2 —u>i )t 

Prove that 

E 

£ x = -[l+cos (u> 2 -wi)f] 

and 

£ 

Ey = -[1 - cos (w 2 - Wi)f] 

to show that the constant total energy is completely exchanged between the two pendulums at the 
beat frequency (u >2 — Wi). 



Problem 4.7 

When the masses of the coupled pendulums of Figure 4.1 are no longer equal the equations of 
motion become 



m \X = —m 1 ( g/l)x — s(x — y) 
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and 

m 2: y = ~m 2 {g/t)y + ■?(■* - y) 

Show that we may choose the normal coordinates 

_ m\x+ m 2 y 

X. — 

m 1 + m 2 

with a normal mode frequency to\=g/l and Y = x — y with a normal mode frequency 
to 2 = g/l + s(l/mi + l/m 2 ). 

Note that X is the coordinate of the centre of mass of the system whilst the effective mass in the Y 
mode is the reduced mass p of the system where l/fi = l//«i + l//w 2 . 

Problem 4.8 

Let the system of Problem 4.7 be set in motion with the initial conditions x = A,y = 0,jc = y = 0 at 
t = 0. Show that the normal mode amplitudes are Xq = (; m\/M)A and Yq = A to yield 

A / 

x = — [m 1 cos to \t + m 2 cos to 2 t) 

and 

mi 

y = A — (cos to it — cos to 2 t ) , 



where M = m 1 + m 2 - 
Express these displacements as 

„ , 2 A, 

x = 2A cos to m t cos to a t + —(mi — m 2 ) sino; m r siniu fl t 

and 

m 1 

y = 2 A — sinw„,(smw„/, 

M 

where to m = (002 — to i)/2 and to a = (to 1 +to 2 )/2. 

Problem 4.9 

Apply the weak coupling conditions of Problem 4.6 to the system of Problem 4.8 to show that the 
energies 

p 

E x = [m\ + m\ + 2miw 2 cos (to 2 - oo\)t\ 

and 

Ey = m 1 J ^ “ C ° S 

Note that E x varies between a maximum of E(att= 0) and a minimum of [(m \ — m 2 ) /M\ 2 E, whilst 
Ey oscillates between a minimum of zero at t = 0 and a maximum of A(m\m 2 /M 2 )E at the beat 
frequency of (to 2 — wi). 
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Problem 4.10 

In the figure below the right hand pendulum of the coupled system is driven by the horizontal force 
F o cos cut as shown. If a small damping constant r is included the equations of motion may be written 



mg . , . 

x = — — x — rx — s(x — y) + Fq cos 



U!t 



and 

mg , , 

my = — y — ry + s{x — y) 

Show that the equations of motion for the normal coordinates X = x + y and Y = x — y are those 
for damped oscillators driven by a force F o cos cot. 

Solve these equations for X and Y and, by neglecting the effect of r, show that 



x 



Fo 

2m 



cos Ult 




and 



where 



Show that 



Fo \ 
y ~ — cos cot | 
2m 



ICJ? — LO 2 CJ? — C0 2 \ 



2 8 . 2 8 , 2s 

cut = - and cuT = - H 

Z l m 



2 2 
y ^ 

x ui\ + u)j — 2ui 2 



and sketch the behaviour of the oscillator with frequency to show that outside the frequency range 
CU 2 — tui the motion of y is attenuated. 



/ / 
s 

( mnrtnsrin rrsirsinn® < F o cos * 

~y* 



Problem 4.11 

The diagram shows an oscillatory force F „ cos cut acting on a mass M which is part of a simple 
harmonic system of stiffness k and is connected to a mass m by a spring of stiffness s. If all 
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Problem 4.13 

A generator of e.m.f. E is coupled to a load Z by means of an ideal transformer. From the diagram, 
Kirchhoff’s Law gives 



E = —e i = \ujLpI | — iujMIi 



and 



I 2 Z 2 = e 2 = i ujMIi - i ujL s I 2 . 



Show that E/1 i, the impedance of the whole system seen by the generator, is the sum of the primary 
impedance and a ‘reflected impedance’ from the secondary circuit of cu 2 M 2 /Z s where 
Z s — Z 2 A i loE s . 




Problem 4.14 

Show, for the perfect transformer of Problem 4.13, that the impedance seen by the generator consists 
of the primary impedance in parallel with an impedance (n p /n s ) 2 Z 2 , where n p and n s are the 
number of primary and secondary transformer coil turns respectively. 

Problem 4.15 

If the generator delivers maximum power when its load equals its own internal impedance show how 
an ideal transformer may be used as a device to match a load to a generator, e.g. a loudspeaker of a 
few ohms impedance to an amplifier output of 10 3 El impedance. 

Problem 4.16 

The two circuits in the diagram are coupled by a variable mutual inductance M and Kirchhoff’s Law 
gives 



Z\I\ + ZmI 2 = E 



and 



Z M I\ + Z 2 I 2 — 0, 



where 



Zfyi — +i luM 

M is varied at a resonant frequency where the reactance X\ = X 2 = 0 to give a maximum value 
of I 2 . Show that the condition for this maximum is uM = \/R \R 2 and that this defines a 
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‘critical coefficient of coupling’ k = (Q 1 Q 2 ) 1 , where the <2’s are the quality factors of the 
circuits. 




Problem 4.17 

Consider the case when the number of masses on the loaded string of this chapter is n = 3. Use 
equation (4.15) to show that the normal mode frequencies are given by 

u>\ = (2 — V2)u,§; u>\ = 2uq 

and 

W3 = (2 + V 2 )u>q 

Repeat the problem using equation (4.14) (with tug = T /ma) in the matrix method of equation (4.7), 
where the eigenvector components are A r _ 1 , A r and A r+ \. 

Problem 4.18 

Show that the relative displacements of the masses in the modes of Problem 4.17 are 1 : \/2 : 1, 
1 : 0 : —1, and 1 : —\f2 : 1. Show by sketching these relative displacements that tighter coupling 
increases the mode frequency. 

Problem 4.19 



m M m 




hi -► h2 * h3 



The figure represents a triatomic molecule with a heavy atom mass M bound to equal atoms of 
smaller mass m on either side. The binding is represented by springs of stiffness s and in equilibrium 
the atom centres are equally spaced along a straight line. Simple harmonic vibrations are considered 
only along this linear axis and are given by 






where r\j is the displacement from equilibrium of the yth atom. 

Set up the equation of motion for each atom and use the matrix method of equation (4.7) to show 
that the normal modes have frequencies 



Wj = 0, W? 



— and u>\ 
m 



s(M + 2m) 
niM 



Describe the motion of the atoms in each normal mode. 
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Problem 4.20 

Taking the maximum value of 



at j = n as that produced by the strongest coupling, deduce the relative displacements of 
neighbouring masses and confirm your deduction by inserting your values in consecutive difference 
equations relating the displacements y r+ \,y r and y r -\. Why is your solution unlikely to satisfy the 
displacements of those masses near the ends of the string? 



2 T 



1 — cos 



JK 

n + 1 



Problem 4.21 

Expand the value of 



2 T 
ma 



^1 — cos 




when j « n in powers of {j/n+ 1) to show that in the limit of very large values of n, a low 
frequency 





where p = m/ a and l = (n + 1 )a. 



Problem 4.22 

An electrical transmission line consists of equal inductances L and capacitances C arranged as 
shown. Using the equations 

Ldl r —l q r— 1 q r 

dr = Vr ~ l ~ V'' = ~q 

and 



E - 1 



~Ir 



d g r 
d t ’ 



show that an expression for I r may be derived which is equivalent to that for y r in the case of the 
mass-loaded string. (This acts as a low pass electric filter and has a cut-off frequency as in the case of 
the string. This cut-off frequency is a characteristic of wave propagation in periodic structures and 
electromagnetic wave guides.) 




Problem 4.23 

Show that 



satisfies the wave equation 



d 2 y _ 2 9 2 y 
dt 2 dx 2 ’ 



if io = ck 
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Summary of Important Results 

In coupled systems each normal coordinate defines a degree of freedom, each degree of 
freedom defines a way in which a system may take up energy. The total energy of the 
system is the sum of the energies in its normal modes of oscillation because these remain 
separate and distinct, and energy is never exchanged between them. 

A simple harmonic oscillator has two normal coordinates [velocity (or momentum) and 
displacement] and therefore two degrees of freedom, the first connected with kinetic 
energy, the second with potential energy. 



n Equal Masses , Separation a. Coupled on a String under Constant Tension T 
Equation of motion of the rth mass is 

my r = (T/a)(y r - 1 - 2 y r + y r + 1 ) 



which for y r = A r q' u1 gives 



2 — maui 2 



—A r+ i + — )A r —A r i — 0 



There are n normal modes with frequencies uij given by 



7T 

u> 2 , = — I 1 — cos 



n + 1 

In a normal mode of frequency uij the nh mass has an amplitude 

rjn 



A r = C sin 



n + 1 



where C is a constant. 



Wave Equation 

In the limit, as separation a — 6x — > 0 equation of motion of the nh mass on a loaded 
string my r = ( T /a)(y r - i — 2 y r + y r +i) becomes the wave equation 

d 2 y _ T d 2 y _ 2 d 2 y 
dt 2 p dx 2 dx 2 

where p is mass per unit length and c is the wave velocity. 
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Partial Differentiation 

From this chapter onwards we shall often need to use the notation of partial differentiation. 

When we are dealing with a function of only one variable, y = fix) say, we write the 
differential coefficient 



dy = ^ f(x + 6x) -f{x) 
dr 6 x ^0 6x 



but if we consider a function of two or more variables, the value of this function will vary 
with a change in any or all of the variables. For instance, the value of the co-ordinate z on 
the surface of a sphere whose equation is x 2 +y 2 +z 2 = a 2 , where a is the radius of the 
sphere, will depend on x and y so that z is a function of x and y written z = z,(x, y). The 
differential change of z which follows from a change of x and y may be written 



dz 





dy 



where (dz/dx) means differentiating z with respect to x whilst y is kept constant, so that 

f—\= lim z ^ x + 6x ^) ~z(x,y) 

\dx ) &— >o 6x 



The total change dr is found by adding the separate increments due to the change of each 
variable in turn whilst the others are kept constant. In Figure 5.1 we can see that keeping y 
constant isolates a plane which cuts the spherical surface in a curved line, and the 
incremental contribution to dz along this line is exactly as though z were a function of x 
only. Now by keeping x constant we turn the plane through 90° and repeat the process with 
y as a variable so that the total increment of dz is the sum of these two processes. 

If only two independent variables are involved, the subscript showing which variable is 
kept constant is omitted without ambiguity. 
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Figure 5.1 Small element of a Spherical Surface showing dz=dz! + dz 2 = ( dz/dx) y Ax+ 
(dz/dy) x dy where each gradient is calculated with one variable remaining constant 



In wave motion our functions will be those of variables of distance and time, and we 
shall write d/dx and <) 2 j'dx 1 for the first or second derivatives with respect to x, whilst the 
time t remains constant. Again, d/dt and d 2 /dt 2 will denote first and second derivatives 
with respect to time, implying that x is kept constant. 



Waves 

One of the simplest ways to demonstrate wave motion is to take the loose end of a long 
rope which is fixed at the other end and to move the loose end quickly up and down. Crests 
and troughs of the waves move down the rope, and if the rope were infinitely long such 
waves would be called progressive waves-these are waves travelling in an unbounded 
medium free from possible reflection (Figure 5.2). 



crest 




Progressive waves on infinitely long string 
Figure 5.2 Progressive transverse waves moving along a string 
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If the medium is limited in extent; for example, if the rope were reduced to a violin 
string, fixed at both ends, the progressive waves travelling on the string would be reflected 
at both ends; the vibration of the string would then be the combination of such waves 
moving to and fro along the string and standing waves would be formed. 

Waves on strings are transverse waves where the displacements or oscillations in the 
medium are transverse to the direction of wave propagation. When the oscillations are parallel 
to the direction of wave propagation the waves are longitudinal. Sound waves are longitudinal 
waves; a gas can sustain only longitudinal waves because transverse waves require a shear 
force to maintain them. Both transverse and longitudinal waves can travel in a solid. 

In this book we are going to discuss plane waves only. When we see wave motion as a 
series of crests and troughs we are in fact observing the vibrational motion of the individual 
oscillators in the medium, and in particular all of those oscillators in a plane of the medium 
which, at the instant of observation, have the same phase in their vibrations. 

If we take a plane perpendicular to the direction of wave propagation and all oscillators 
lying within that plane have a common phase, we shall observe with time how that plane of 
common phase progresses through the medium. Over such a plane, all parameters 
describing the wave motion remain constant. The crests and troughs are planes of 
maximum amplitude of oscillation which are 7r rad out of phase; a crest is a plane of 
maximum positive amplitude, while a trough is a plane of maximum negative amplitude. In 
formulating such wave motion in mathematical terms we shall have to relate the phase 
difference between any two planes to their physical separation in space. We have, in 
principle, already done this in our discussion on oscillators. 

Spherical waves are waves in which the surfaces of common phase are spheres and the 
source of waves is a central point, e.g. an explosion; each spherical surface defines a set of 
oscillators over which the radiating disturbance has imposed a common phase in vibration. 
In practice, spherical waves become plane waves after travelling a very short distance. A 
small section of a spherical surface is a very close approximation to a plane. 

Velocities in Wave Motion 

At the outset we must be very clear about one point. The individual oscillators which make 
up the medium do not progress through the medium with the waves. Their motion is simple 
harmonic, limited to oscillations, transverse or longitudinal, about their equilibrium 
positions. It is their phase relationships we observe as waves, not their progressive motion 
through the medium. 

There are three velocities in wave motion which are quite distinct although they are 
connected mathematically. They are 

1. The particle velocity, which is the simple harmonic velocity of the oscillator about its 
equilibrium position. 

2. The wave or phase velocity, the velocity with which planes of equal phase, crests or 
troughs, progress through the medium. 

3. The group velocity. A number of waves of different frequencies, wavelengths and 
velocities may be superposed to form a group. Waves rarely occur as single 
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monochromatic components; a white light pulse consists of an infinitely fine spectrum 
of frequencies and the motion of such a pulse would be described by its group velocity. 
Such a group would, of course, ‘disperse’ with time because the wave velocity of each 
component would be different in all media except free space. Only in free space would 
it remain as white light. We shall discuss group velocity as a separate topic in a later 
section of this chapter. Its importance is that it is the velocity with which the energy in 
the wave group is transmitted. For a monochromatic wave the group velocity and the 
wave velocity are identical. Here we shall concentrate on particle and wave velocities. 



The Wave Equation 



This equation will dominate the rest of this text and we shall derive it, first of all, by 
considering the motion of transverse waves on a string. 

We shall consider the vertical displacement y of a very short section of a uniform string. 
This section will perform vertical simple harmonic motions; it is our simple oscillator. The 
displacement y will, of course, vary with the time and also with x, the position along the 
string at which we choose to observe the oscillation. 

The wave equation therefore will relate the displacement y of a single oscillator to 
distance x and time t. We shall consider oscillations only in the plane of the paper, so that 
our transverse waves on the string are plane polarized. 

The mass of the uniform string per unit length or its linear density is p, and a constant 
tension T exists throughout the string although it is slightly extensible. 

This requires us to consider such a short length and such small oscillations that we may 
linearize our equations. The effect of gravity is neglected. 

Thus in Figure 5.3 the forces acting on the curved element of length d.v are T at an angle 9 
to the axis at one end of the element, and T at an angle f) + <A0 at the other end. The length 
of the curved element is 



ds 



I + 




21 X l 2 



dx 




Figure 5.3 Displaced element of string of Length ds « dx with tension T acting at an angle 6 at x 
and at 6 + A9 at x + dx 
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but within the limitations imposed dy/dx is so small that we ignore its square and take 
d.s' = dx. The mass of the element of string is therefore pds = pdx. Its equation of motion is 
found from Newton’s Law, force equals mass times acceleration. 

The perpendicular force on the element dx is T sin (# + d 6) — T sin fj in the positive y 
direction, which equals the product of pdx (mass) and d 2 y/dt 2 (acceleration). 

Since 9 is very small sin# ~ tan# = dy/dx, so that the force is given by 



T 





xJ 



where the subscripts refer to the point at which the partial derivative is evaluated. The 
difference between the two terms in the bracket defines the differential coefficient of the 
partial derivative dy/dx times the space interval dx, so that the force is 



T 



d 2 y 
dx 2 



dx 



The equation of motion of the small element dx then becomes 



<9 2 y 
T — - 
dx 2 



dx = p dr 



d 2 y 

Hi 2 



or 

d 2 y _ p d 2 y 

dii 2 = f~dt 2 

giving 

d 2 y 1 d 2 y 

dx 2 c 2 dt 2 

where T / p has the dimensions of a velocity squared, so c in the preceding equation is a 
velocity. THIS IS THE WAVE EQUATION. 

It relates the acceleration of a simple harmonic oscillator in a medium to the second 
derivative of its displacement with respect to its position, x, in the medium. The position of 
the term c 2 in the equation is always shown by a rapid dimensional analysis. 

So far we have not explicitly stated which velocity c represents. We shall see that it is the 
wave or phase velocity, the velocity with which planes of common phase are propagated. In 
the string the velocity arises as the ratio of the tension to the inertial density of the string. 
We shall see, whatever the waves, that the wave velocity can always be expressed as a 
function of the elasticity or potential energy storing mechanism in the medium and the 
inertia of the medium through which its kinetic or inductive energy is stored. For 
longitudinal waves in a solid the elasticity is measured by Young’s modulus, in a gas by 7 P, 
where 7 is the specific heat ratio and P is the gas pressure. 
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Solution of the Wave Equation 

The solution of the wave equation 



d 2 y 1 d 2 y 

dx 2 c 2 dt 2 



will, of course, be a function of the variables x and t. We are going to show that any 
function of the form y = f\ (ct — x) is a solution. Moreover, any function y = f' 2 ( ct +x) 
will be a solution so that, generally, their superposition y = f \ (ct — x) + f '2 ( ct + x) is the 
complete solution. 

If f[ represents the differentiation of the function with respect to the bracket ( ct — x), 
then using the chain rule which also applies to partial differentiation 



dy 

dx 



- f[(ct-X ) 



and 



also 



and 



d 2 y 
dx 2 



f"{ct-x ) 



- cf((ct - x) 



d 2 y 

W~ 



c 2 f"{ct - x) 



so that 

d 2 y 1 d 2 y 
dx 2 c 2 dt 2 

for y = f\(ct — x). When y =f 2 (ct + x) a similar result holds. 



(Problems 5.1, 5.2) 

If y is the simple harmonic displacement of an oscillator at position x and time t we 
would expect, from Chapter 1, to be able to express it in the form y = a sin {u>t — &), and in 
fact all of the waves we discuss in this book will be described by sine or cosine functions. 

The bracket (ct — x) in the expression _y = f(ct — x) has the dimensions of a length and, 
for the function to be a sine or cosine, its argument must have the dimensions of radians so 
that (ct — x) must be multiplied by a factor 2 tt/ A, where A is a length to be defined. 
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X 



Figure 5.4 Locus of oscillator displacements in a continuous medium as a wave passes over them 
travelling in the positive x-direction. The wavelength A is defined as the distance between any two 
osciLlators having a phase difference of 27rrad 



We can now write 



y — a sin (cot — </>) = a sin {ct — x) 

A 



as a solution to the wave equation if 2ttc/X — u> — 2-ia', where v is the oscillation 
frequency and <f> — 2 -kx/X. 

This means that if a wave, moving to the right, passes over the oscillators in a medium 
and a photograph is taken at time t = 0, the locus of the oscillator displacements (Fig- 
ure 5.4) will be given by the expression y — a sin (cut — <i>) = a sin 2 tt( ct — x)/\. If we now 
observe the motion of the oscillator at the position x — 0 it will be given by y = a sin cot. 

Any oscillator to its right at some position x will be set in motion at some later time by 
the wave moving to the right; this motion will be given by 



y = a sin (uit — f) 



2i r 



a sin 



{ct — x) 



having a phase lag of <p with respect to the oscillator at x = 0. This phase lag <f> = 2 -kx/X, 
so that if a = A the phase lag is 27r rad that is, equivalent to exactly one complete vibration 
of an oscillator. 

This defines A as the wavelength , the separation in space between any two oscillators 
with a phase difference of 27 t rad. The expression 2irc/X = ui — 2 tuj gives c = uX, where 
c, the wave or phase velocity, is the product of the frequency and the wavelength. Thus, 
X/c = l/v = t, the period of oscillation, showing that the wave travels one wavelength in 
this time. An observer at any point would be passed by v wavelengths per second, a 
distance per unit time equal to the velocity c of the wave. 

If the wave is moving to the left the sign of <b is changed because the oscillation at x 
begins before that at a = 0. Thus, the bracket 



{ct — a) denotes a wave moving to the right 
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and 



( ct + x ) gives a wave moving in the direction of negative x. 

There are several equivalent expressions for y = f(ct — x) which we list here as sine 
functions, although cosine functions are equally valid. 

They are: 



.V = 
.V = 
.V = 
.V = 



. 27 r 

a sin — 
A 

a sin 27 t 



{ct — x) 




a sincu^f — 
a sin (c ot — kx) 



where k = 2n/X = lo/c is called the wave number ; also y = ae 1 * 1 "^, the exponential 
representation of both sine and cosine. 

Each of the expressions above is a solution to the wave equation giving the displacement 
of an oscillator and its phase with respect to some reference oscillator. The changes of the 
displacements of the oscillators and the propagation of their phases are what we observe as 
wave motion. 

The wave or phase velocity is, of course, dx/dt, the rate at which the disturbance moves 
across the oscillators; the oscillator or particle velocity is the simple harmonic velocity 

dy/dt. 

Choosing any one of the expressions above for a right-going wave, e.g. 

y = a sin {cut — kx) 

we have 

dy 

— = tea cos (c ot — kx) 

and 

dy 

-4 - = —ka cos (t ot — kx) 
ox 

so that 

/ dx dy\ 

\ dt dx ) 

the product of the wave velocity 



dy lo dy dy 
dt k dx C dx 



The particle velocity dy/dt is therefore given as 

dx 



c = 



dt 



and the gradient of the wave profile preceded by a negative sign for a right-going wave 

y =f(ct~x) 
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Figure 5.5 The magnitude and direction of the particle velocity dy/dt = —c(dy/dx) at any point* 
is shown by an arrow in the right-going sine wave above 

In Figure 5.5 the arrows show the direction of the particle velocity at various points of 
the right-going wave. It is evident that the particle velocity increases in the same direction 
as the transverse force in the wave and we shall see in the next section that this force is 
given by 

— Tdy/dx 

where T is the tension in the string. 

(Problem 5.3) 

Characteristic Impedance of a String (the string as a forced 
oscillator) 

Any medium through which waves propagate will present an impedance to those waves. If 
the medium is lossless, and possesses no resistive or dissipation mechanism, this 
impedance will be determined by the two energy storing parameters, inertia and elasticity, 
and it will be real. The presence of a loss mechanism will introduce a complex term into 
the impedance. 

A string presents such an impedance to progressive waves and this is defined, because of 
the nature of the waves, as the transverse impedance 

„ transverse force F 



transverse velocity v 
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F 0 e mt 




F 0 e mt = -Ts\n9 

Figure 5.6 The string as a forced oscillator with a vertical force F 0 e lu,t driving it at one end 



The following analysis will emphasize the dual role of the string as a medium and as a 
forced oscillator. 

In Figure 5.6 we consider progressive waves on the string which are generated at one end 
by an oscillating force, F oe lrf , which is restricted to the direction transverse to the string 
and operates only in the plane of the paper. The tension in the string has a constant value, T, 
and at the end of the string the balance of forces shows that the applied force is equal and 
opposite to T sin 9 at all time, so that 



Fq e iu>t 



— T sin# 



—T tan# = —T 




where 9 is small. 

The displacement of the progressive waves may be represented exponentially by 

y = Ae‘^“^ 

where the amplitude A may be complex because of its phase relation with F. At the end of 
the string, where x = 0, 






-T 




= ikTAe i{u}t ~ k0) 

x=0 



giving 

A = — = — 
i kT iw 77 

and 



y 



Fo 

i u> 



e i(ut-kx) 



(since c = w/k). 
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The transverse velocity 



▼ = y = ^o(|)e i(wr - fa) 

where the velocity amplitude v = Fq/Z, gives a transverse impedance 

Z = — = pc (since T = pc 1 ) 
c 

or Characteristic Impedance of the string. 

Since the velocity c is determined by the inertia and the elasticity, the impedance is also 
governed by these properties. 

(We can see that the amplitude of displacement y = Fq/loZ, with the phase relationship 
— i with respect to the force, is in complete accord with our discussion in Chapter 3.) 

Reflection and Transmission of Waves on a String at a Boundary 

We have seen that a string presents a characteristic impedance pc to waves travelling along 
it, and we ask how the waves will respond to a sudden change of impedance; that is, of the 
value pc. We shall ask this question of all the waves we discuss, acoustic waves, voltage 
and current waves and electromagnetic waves, and we shall find a remarkably consistent 
pattern in their behaviour. 

We suppose that a string consists of two sections smoothly joined at a point x = 0 with a 
constant tension T along the whole string. The two sections have different linear densities 
p i and p 2, and therefore different wave velocities T/pi = c 1 and T j pi = c\. The specific 
impedances are p \ c \ and P 2 C 2 , respectively. 

An incident wave travelling along the string meets the discontinuity in impedance at the 
position x = 0 in Figure 5.7. At this position, x = 0, a part of the incident wave will be 
reflected and part of it will be transmitted into the region of impedance p2Ci. 

We shall denote the impedance p\C\ by Z] and the impedance P 2 C 2 by Zo. We write the 
displacement of the incident wave as y t —A je 1 ^ 1- ^, a wave of real (not complex) 



Transmitted wave 




Figure 5.7 Waves on a string of impedance p\C\ reflected and transmitted at the boundary x = 0 
where the string changes to impedance pzCz 
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amplitude A i travelling in the positive x-direction with velocity c i. The displacement of 
the reflected wave is y r = B \W^ ,+klX \ of amplitude B\ and travelling in the negative 
x-direction with velocity c i. 

The transmitted wave displacement is given by y, = Ai of amplitude A 2 and 

travelling in the positive x-direction with velocity c 2 . 

We wish to find the reflection and transmission amplitude coefficients; that is, the relative 
values of B 1 and A 2 with respect to A 1 . We find these via two boundary conditions which 
must be satisfied at the impedance discontinuity at x = 0. 

The boundary conditions which apply at x = 0 are: 

1 . A geometrical condition that the displacement is the same immediately to the left and 
right of x = 0 for all time, so that there is no discontinuity of displacement. 

2 . A dynamical condition that there is a continuity of the transverse force T(dy/dx ) at 
x = 0 , and therefore a continuous slope. This must hold, otherwise a finite difference in 
the force acts on an infinitesimally small mass of the string giving an infinite 
acceleration; this is not permitted. 

Condition ( 1 ) at x = 0 gives 

W + Vr = y t 



or 



^(uit-kix) B ^ e i(ut+k,x) _ A 2& i{ut-k 2 x) 



At x = 0 we may cancel the exponential terms giving 



A 1 +61 — A 2 



( 5 . 1 ) 



Condition ( 2 ) gives 



at x — 0 for all t, so that 



T d d 

T g x (yr+y ' ) = T W 



— k 1 TA 1 T k 1 TB \ — — k2TA2 



or 



T A T n T t 

— 10 — A\ T io — B\ — — ut — Ai 
c 1 Cl C 2 



after cancelling exponentials at x = 0 . But T/c 1 = P\C\ — Z\ and T/c 2 = P2C2 = Z2, 
so that 



- 5 ,)=Z 2 A 2 



( 5 . 2 ) 
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Equations (5.1) and (5.2) give the 



Reflection coefficient of amplitude , 



Bi 

Ai 



Zi~Z 2 
Z\ + z 2 



and the 



Transmission coefficient of amplitude , — = ! — 

A i Z\ + Z 2 

We see immediately that these coefficients are independent of u> and hold for waves of all 
frequencies; they are real and therefore free from phase changes other than that of 7r rad 
which will change the sign of a term. Moreover, these ratios depend entirely upon the ratios 
of the impedances. (See summary on p. 546). If Z 2 = oo, this is equivalent to x = 0 being a 
fixed end to the string because no transmitted wave exists. This gives B\/A \ = — 1 , so that 
the incident wave is completely reflected (as we expect) with a phase change of 7r (phase 
reversal) — conditions we shall find to be necessary for standing waves to exist. A group of 
waves having many component frequencies will retain its shape upon reflection at Z 2 = oo, 
but will suffer reversal (Figure 5.8). If Z 2 = 0, so that x = 0 is a free end of the string, then 
B \ / A \ = 1 and A 2 /A\ = 2. This explains the ‘flick’ at the end of a whip or free ended 
string when a wave reaches it. 



Reflection of pulse having many 

frequency components 



Incident 

Pulse 



Infinite 
Impedance 
pC = °° 



Reflected 
1 Pulse 




C 



Figure 5.8 A pufse of arbitrary shape is reflected at an infinite impedance with a phase change of 
7r rad, so that the reflected pulse is the inverted and reversed shape of the initial waveform. The pulse 
at reflection is divided in the figure into three sections A, B, and C. At the moment of observation 
section C has aLready been reflected and suffered inversion and reversal to become C'. The actual 
shape of the pulse observed at this instant is A being A + B — C' where B = C'. The displacement at 
the point of reflection must be zero. 
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(Problems 5.4, 5.5, 5.6) 

Reflection and Transmission of Energy 

Our interest in waves, however, is chiefly concerned with their function of transferring 
energy throughout a medium, and we shall now consider what happens to the energy in a 
wave when it meets a boundary between two media of different impedance values. 

If we consider each unit length, mass p, of the string as a simple harmonic oscillator of 
maximum amplitude A, we know that its total energy will be E = \ f>uj 2 A 2 , where uj is the 
wave frequency. 

The wave is travelling at a velocity c so that as each unit length of string takes up its 
oscillation with the passage of the wave the rate at which energy is being carried along the 
string is 

(energy x velocity) = \ pco 2 A 2 c 

Thus, the rate of energy arriving at the boundary x = 0 is the energy arriving with the 
incident wave; that is 



\ p\c\ui 2 A\ = jZiiu 2 Aj 

The rate at which energy leaves the boundary, via the reflected and transmitted waves, is 
\p\c\uj 2 B\ + \p 2 C 2 U 2 A\ = \Z\ui 2 B\ + \Z 1 J 1 A\ 
which, from the ratio B \ j A \ and A 2/ A 1 , 



= \u 2 A\ 



Z\(Z\ — Zi) + 4 Z^Zo 

(Zi +z 2 ) 2 



\ Z\ui 2 A\ 



Thus, energy is conserved, and all energy arriving at the boundary in the incident 
wave leaves the boundary in the reflected and transmitted waves. 



The Reflected and Transmitted Intensity Coefficients 

These are given by 

Reflected Energy Z\B\ /BA 2 fZ\—Z2 \ 2 

Incident Energy Z\A\ \Ai/ \Zi+Z2/ 

Transmitted Energy Zo/t \ WZ\Zi 

Incident Energy ZjAj (Z\+Zi ) 2 



We see that if Z\ = Z2 no energy is reflected and the impedances are said to be matched. 
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(Problems 5.7, 5.8) 

The Matching of Impedances 

Impedance matching represents a very important practical problem in the transfer of 
energy. Long distance cables carrying energy must be accurately matched at all joints to 
avoid wastage from energy reflection. The power transfer from any generator is a 
maximum when the load matches the generator impedance. A loudspeaker is matched to 
the impedance of the power output of an amplifier by choosing the correct turns ratio on the 
coupling transformer. This last example, the insertion of a coupling element between two 
mismatched impedances, is of fundamental importance with applications in many branches 
of engineering physics and optics. We shall illustrate it using waves on a string, but the 
results will be valid for all wave systems. 

We have seen that when a smooth joint exists between two strings of different 
impedances, energy will be reflected at the boundary. We are now going to see that the 
insertion of a particular length of another string between these two mismatched strings will 
allow us to eliminate energy reflection and match the impedances. 

In Figure 5.9 we require to match the impedances Z\ — p\C\ and Z 3 = /O3C3 by the 
smooth insertion of a string of length l and impedance Zj = pici- Our problem is to find 
the values of / and Z2. 




Figure 5.9 The impedances Z\ and Z 3 of two strings are matched by the insertion of a length / of a 
string of impedance Z 2 . The incident and reflected waves are shown for the boundaries x = 0 and 
x = t. The impedances are matched when Z\ = ZiZ 3 and / = A/4 in Z 2 , results which are true for 
waves in all media 
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The incident, reflected and transmitted displacements at the junctions x = 0 and x —1 are 
shown in Figure 5.9 and we seek to make the ratio 

Transmitted energy Z\A ) 

Incident energy Z\A\ 



equal to unity. 

The boundary conditions are that y and Tidy f dx) are continuous across the junctions 
x = 0 and x = l. 

Between Z i and Z 2 the continuity of y gives 

A J e i(ujt—k ix) _j_ g e i(u»+tu) _ A 2 Q^t-k 2 x) _|_ g^ e i(ut+k 2 x) 



or 

A \ + B \ = A 2 + B 2 (at x = 0) (5.3) 

Similarly the continuity of T(dy/dx ) at x = 0 gives 

T(—\k i A | + ik\B{) = T{— \k 2 A 2 + ik^B^) 

Dividing this equation by ui and remembering that Tik/uj) = T/c = pc = Z we have 

Z l (A l -B l )=Z 2 (A 2 -B 2 ) (5.4) 

Similarly at x = /, the continuity of y gives 

A 2 e- ik21 + B 2 e ikl1 =Ai (5.5) 

and the continuity of T(dy/dx ) gives 

Z 2 (A 2 - B 2 &' kl ‘) = Z 3 A 3 (5.6) 

From the four boundary equations (5.3), (5.4), (5.5) and (5.6) we require the ratio A 3 /A 1 . 
We use equations (5.3) and (5.4) to eliminate B \ and obtain A 1 in terms of A 2 and B 2 . We 
then use equations (5.5) and (5.6) to obtain both A 2 and B 2 in terms of A 3 . Equations (5.3) 
and (5.4) give 



or 



Zi(A 1 — A 2 — B 2 + A 1 ) = Z 2 (A 2 — B 2 ) 



A 1 



A 2 (n 2 + 1) + B 2 (r l2 — 1) 
2 r 12 



(5.7) 




where 
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Equations (5.5) and (5.6) give 



A 2 



r 23 + 1 
2r 2 3 



A 3 e ikl ' 



and 



where 



B 2 



r 23 ~ 1 
2 r 93 



A 3 e 



— 



r 23 



Z2 

z 3 



Equations (5.7) and (5.8) give 



where 



A, = — ^[(r 12 + l)(r 23 +l)e lW +(r 12 - l)(r 23 - l)e- iW ] 
4r i 2 r 23 

= P~ [(n 3 + l)(e iW + e- iW ) + (rn + r 2 3 )(^‘ ~ e“ iW )] 

4r i 3 
A 3 

= - — [(r i3 + 1) cos k 2 l + i(r 12 + r 23 ) sin/: 2 /] 

Zr !3 



Zi z 2 

''l 2 r 2 3 = 7^77- 
^2 ^3 




Hence 



f M\ 2 = 4r|j 

\Ai/ (r 13 + l ) 2 cos 2 k 2 I + (r 12 + r 23 ) 2 sin 2 k 2 l 



or 

transmitted energy Z 3 A 2 1 A 2 
incident energy Z 1 A 2 r 13 A j 

4/j3 

(r 13 + 1 ) 2 cos 2 k 2 l + (r l2 + r 23 ) 2 sin 2 k 2 l 
If we choose l = A 2 /4,cosA 2 ( = 0 and sinA 2 / = 1 we have 



Z 3 A 5 _ 4r i 3 

Zi A 2 (r 12 + r 23 ) 2 



when 



(5.8) 



r 12 = r 23 
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that is, when 

Z 1 Z 2 , 

— — — or Z 2 - VZ 1 Z 3 
■^2 ^3 

We see, therefore, that if the impedance of the coupling medium is the harmonic mean of 
the two impedances to be matched and the thickness of the coupling medium is 

— where A 2 = — 

4 ‘ k 2 

all the energy at frequency u> will be transmitted with zero reflection. 

The thickness of the dielectric coating of optical lenses which eliminates reflections 
as light passes from air into glass is one quarter of a wavelength. The ‘bloomed’ appearance 
arises because exact matching occurs at only one frequency. Transmission lines are matched 
to loads by inserting quarter wavelength stubs of lines with the appropriate impedance. 

(Problems 5.9, 5.10) 

Standing Waves on a String of Fixed Length 

We have already seen that a progressive wave is completely reflected at an infinite 
impedance with a 7 r phase change in amplitude. A string of fixed length l with both ends 
rigidly clamped presents an infinite impedance at each end; we now investigate the 
behaviour of waves on such a string. Let us consider the simplest case of a monochromatic 
wave of one frequency ui with an amplitude a travelling in the positive x-direction and an 
amplitude b travelling in the negative x-direction. The displacement on the string at any 
point would then be given by 

y = + be i{ut+kx} 

with the boundary condition that y = 0 at x = 0 and x = l at all times. 

The condition y — 0 at x = 0 gives 0 = (a + b) e lw * for all t, so that a = —b. This 
expresses physically the fact that a wave in either direction meeting the infinite impedance 
at either end is completely reflected with a 7r phase change in amplitude. This is a general 
result for all wave shapes and frequencies. 

Thus 



y = ae iuJt (e~ ikx - e ifa ) = (-2i)ae iwf sinfcc (5.9) 

an expression for y which satisfies the standing wave time independent form of the wave 
equation 

d 2 y/dx 2 + k 2 y = 0 

because (1 ( / c 2 ){d 2 y / dt 2 ) = (—u> 2 /c 2 )y — —k 2 y. The condition that y = 0 at x = I for all t 
requires 

. . (jjl i ol 

sin kl = sin — = 0 or — = nir 
c c 
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limiting the values of allowed frequencies to 

tmc 

Un= — 

or 

nc c 

Un = Yi = \, 

that is 

/ - n ^ n 
2 

giving 

. U! n X . tlTTX 

sin = sin — — 

c l 

These frequencies are the normal frequencies or modes of vibration we first met in 
Chapter 4. They are often called eigenfrequencies, particularly in wave mechanics. 

Such allowed frequencies define the length of the string as an exact number of half 
wavelengths, and Figure 5.10 shows the string displacement for the first four harmonics 
(n = 1,2, 3,4). The value for n = 1 is called the fundamental . 

As with the loaded string of Chapter 4, all normal modes may be present at the same 
time and the general displacement is the superposition of the displacements at each 
frequency. This is a more complicated problem which we discuss in Chapter 10 (Fourier 
Methods). 

For the moment we see that for each single harmonic n > 1 there will be a number of 
positions along the string which are always at rest. These points occur where 

. uj„x . nnx 
sin — — = sin — — = 0 
c l 

or 

~j~ = r7T (r = 0, 1,2,3,... n) 




Figure 5.10 The first four harmonics, n = 1, 2, 3, 4 of the standing waves aLLowed between the two 
fixed ends of a string 
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The values r = 0 and r = n give x = 0 and x = l, the ends of the string, but between the ends 
there are n — 1 positions equally spaced along the string in the nth harmonic where the 
displacement is always zero. These positions are called nodes or nodal points, being the 
positions of zero motion in a system of standing waves. Standing waves arise when a 
single mode is excited and the incident and reflected waves are superposed. If the amplitudes 
of these progressive waves are equal and opposite (resulting from complete reflection), 
nodal points will exist. Often however, the reflection is not quite complete and the waves in 
the opposite direction do not cancel each other to give complete nodal points. In this case 
we speak of a standing wave ratio which we shall discuss in the next section but one. 

Whenever nodal points exist, however, we know that the waves travelling in opposite 
directions are exactly equal in all respects so that the energy carried in one direction is 
exactly equal to that carried in the other. This means that the total energy flux; that is, the 
energy carried across unit area per second in a standing wave system, is zero. 

Returning to equation (5.9), we see that the complete expression for the displacement of 
the nth harmonic is given by 



y„ = 



2a(— i)(cosuV + i sinu;„r) sin 



LO„X 

c 



We can express this in the form 



u>„x 



, s uy yijv 

y n = (A„ cos io n t + B„ sin uj„t) sin 



where the amplitude of the nth mode is given by (A^ + B „) 1/2 = 2 a 



(5.10) 



(Problem 5.11) 



Energy of a Vibrating String 



A vibrating string possesses both kinetic and potential energy. The kinetic energy of an 
element of length dx and linear density p is given by ( p dry 2 ; the total kinetic energy is the 
integral of this along the length of the string. 

Thus 



E kin = h 



py 2 dx 



The potential energy is the work done by the tension T in extending an element dx to a new 
length d.s when the string is vibrating. 

Thus 



E pot — 



T(ds — dx) = 




dx 





dx 



if we neglect higher powers of dy/dx. 
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Now the change in the length of the element dx is ^(dy/dx ) 2 dr, and if the string is 
elastic the change in tension is proportional to the change in length so that, provided 
(dy/dx) in the wave is of the first order of small quantities, the change in tension is of the 
second order and T may be considered constant. 



Energy in Each Normal Mode of a Vibrating String 

The total displacement y in the string is the superposition of the displacements y n of the 
individual harmonics and we can find the energy in each harmonic by replacing y n for y in 
the results of the last section. Thus, the kinetic energy in the nth harmonic is 

E„ (kinetic) = \ 

and the potential energy is 

E n (potential) =\T 

Since we have already shown for standing waves that 



pyl d* 



dy„ 

dx 



dx 



then 



and 



Thus 



y„ 



(. A n cos io n t + B n sinu;„f) sin 



<jJ„X 

c 



y„ = 



{—A n oj n sint o n t + B n u>„ cost o n t) sin 



U!„X 

c 



dy n , L 0 n x 

— — = — [A n cosiu n t + B„ sint o„t) cos 

ox c c 



£’„(kinetic) = ^pu>l[—A„ sintu„r + B n costu„t]' 



sm 



2 tU n X 

dx 



and 



E„ (potential) = \ T [A n cost o n t + B„ sint o n t \ ' 



cos 



2 t O n X 

— dx 



Remembering that T — pc 2 we have 

£„ (kinetic + potential) = \plcu 2 (A 2 + B 2 ) 
= \muHAl + B% 



where m is the mass of the string and (A 2 + B 2 ) is the square of the maximum 
displacement (amplitude) of the mode. To find the exact value of the total energy E„ of the 
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mode we would need to know the precise value of A„ and B„ and we shall evaluate these in 
Chapter 10 on Fourier Methods. The total energy of the vibrating string is, of course, the 
sum of all the E„’s of the normal modes. 

(Problem 5.12) 



Standing Wave Ratio 

When a wave is completely reflected the superposition of the incident and reflected 
amplitudes will give nodal points (zero amplitude) where the incident and reflected 
amplitudes cancel each other, and points of maximum displacement equal to twice the 
incident amplitude where they reinforce. 

If a progressive wave system is partially reflected from a boundary let the amplitude 
reflection coefficient B\/A \ of the earlier section be written as r, where r < 1 . 

The maximum amplitude at reinforcement is then A \ + B i ; the minimum amplitude is 
given by A i —B In this case the ratio of maximum to minimum amplitudes in the 
standing wave system is called the 

A i + B i 1 + r 



Standing Wave Ratio — 



A , - B , 



1 - r 



where r — B \/A\. 

Measuring the values of the maximum and minimum amplitudes gives the value of the 
reflection coefficient for 



r — B i/Ai = 



SWR — 1 



SWR+ 1 



where SWR refers to the Standing Wave Ratio. 



(Problem 5.13) 



Wave Groups and Group Velocity 

Our discussion so far has been limited to monochromatic waves — waves of a single 
frequency and wavelength. It is much more common for waves to occur as a mixture of 
a number or group of component frequencies; white light, for instance, is composed of 
a continuous visible wavelength spectrum extending from about 3000 A in the blue to 
7000 A in the red. Examining the behaviour of such a group leads to the third kind of 
velocity mentioned at the beginning of this chapter; that is, the group velocity. 

Superposition of Two Waves of Almost Equal Frequencies 

We begin by considering a group which consists of two components of equal amplitude a 
but frequencies uj \ and u >2 which differ by a small amount. 

Their separate displacements are given by 



y i = a cos (u \ t — k \x) 




Wave Groups and Group Velocity 



129 



and 

y 2 = a cos (uiot — kjx) 

Superposition of amplitude and phase gives 



y~yi~\~y 2 = 2a cos 



(0J1 - 0Ji)t (£1 - k 2 )x 




{UJ\ + IX >2 )t 


{k 1 + k 2 )x 


2 2 


tub 


2 


2 



a wave system with a frequency (uq + uj 2) /2 which is very close to the frequency of either 
component but with a maximum amplitude of 2a, modulated in space and time by a very 
slowly varying envelope of frequency (ui \ — 0 ^ 2 ) /2 and wave number (k \ — k 2 )j 2 . 

This system is shown in Figure 5.11 and shows, of course a behaviour similar to that of 
the equivalent coupled oscillators in Chapter 4. The velocity of the new wave is 
(uq — ui2)/{k\ — Aq) which, if the phase velocities uq/fci — uq/&2 = c, gives 

wi - l o 2 (^1 - k 2 ) 

= c = c 

k 1 - k 2 k 1 - k 2 



so that the component frequencies and their superposition, or group will travel with the 
same velocity, the profile of their combination in Figure 5.11 remaining constant. 

If the waves are sound waves the intensity is a maximum whenever the amplitude is a 
maximum of 2a; this occurs twice for every period of the modulating frequency; that is, at 
a frequency v\ — v 2 - 




Oscillation of 
frequency ®i + <*>2 
2 



Figure 5.11 The superposition of two waves of slightly different frequency u>i and u! 2 forms a 
group. The faster osciLlation occurs at the average frequency of the two components (uj 1 + tu 2 )/2 
and the slowLy varying group enveLope has a frequency (oj 1 — w 2 )/ 2, half the frequency difference 
between the components 
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The beats of maximum intensity fluctuations thus have a frequency equal to the 
difference — v 2 of the components. In the example here where the components have 
equal amplitudes a, superposition will produce an amplitude which varies between 2 a and 
0; this is called complete or 100% modulation. 

More generally an amplitude modulated wave may be represented by 

y — A cos (cot — kx ) 



where the modulated amplitude 

A = a + bcosco't 



This gives 

b 

y — a cos (cot — kx) + - {[cos (co + io')t — kx] + [cos (co — oo’)t — /r.r]} 

so that here amplitude modulation has introduced two new frequencies co ± u>', known as 
combination tones or sidebands. Amplitude modulation of a carrier frequency is a common 
form of radio transmission, but its generation of sidebands has led to the crowding of radio 
frequencies and interference between stations. 



Wave Groups and Group Velocity 

Suppose now that the two frequency components of the last section have different phase 
velocities so that co\/k\ =/=■ coi/ki- The velocity of the maximum amplitude of the group; 
that is, the group velocity 

U0\ — U>2 AcU 

k 1 — k 2 AA' 

is now different from each of these velocities; the superposition of the two waves will no 
longer remain constant and the group profile will change with time. 

A medium in which the phase velocity is frequency dependent ( co/k not constant) is 
known as a dispersive medium and a dispersion relation expresses the variation of co as a 
function of k. If a group contains a number of components of frequencies which are nearly 
equal the original expression for the group velocity is written 

A co dt o 

A k d k 

The group velocity is that of the maximum amplitude of the group so that it is the velocity 
with which the energy in the group is transmitted. Since co = kv, where v is the phase 
velocity, the group velocity 



do; d 



d k 
— v — A 



dk 

dv 

dA 



v + k 



dv 
d k 
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Figure 5.12 Curves illustrating dispersion relations: (a) a straight line representing a non- 
dispersive medium, v = v g ; (b) a normal dispersion relation where the gradient v = tv/k > 
v g = duj/dk; (c) an anomalous dispersion relation where v < v g 



where k = 2 tt/\. Usually du/dA is positive, so that v g < v. This is called normal 
dispersion, but anomalous dispersion can arise when du/d A is negative, so that v g > v. 

We shall see when we discuss electromagnetic waves that an electrical conductor is 
anomalously dispersive to these waves whilst a dielectric is normally dispersive except at the 
natural resonant frequencies of its atoms. In the chapter on forced oscillations we saw that 
the wave then acted as a driving force upon the atomic oscillators and that strong 
absorption of the wave energy was represented by the dissipation fraction of the oscillator 
impedance, whilst the anomalous dispersion curve followed the value of the reactive part of 
the impedance. 

The three curves of Figure 5.12 represent 

• A non-dispersive medium where uj/k is constant, so that v g = v, for instance free space 
behaviour towards light waves. 

• A normal dispersion relation v g < v. 

• An anomalous dispersion relation v g > v. 

Example. The electric vector of an electromagnetic wave propagates in a dielectric with a 
velocity v = (/is) 12 where p is the permeability and e is the permittivity. In free space 
the velocity is that of light, c = (po£o) _1 ^ 2 . The refractive index 
n = c/v = yj pe/ pq£q = sjp r e r where p r — p/po and e r = e/e 0 - For many substances 
p r is constant and ~ 1, but e r is frequency dependent, so that v depends on A. 

The group velocity 
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Refractive 




Figure 5.13 Anomalous dispersion showing the behaviour of the refractive index n = sJT t versus to 
and A, where cu 0 is a resonant frequency of the atoms of the medium. The absorption in such a region 
is also shown by the dotted line 



so that v g > v (anomalous dispersion) when de,/dX is +ve. Figure 5.13 shows the 
behaviour of the refractive index n = y/e~ r versus u>, the frequency, and A, the wavelength, 
in the region of anomalous dispersion associated with a resonant frequency. The dotted 
curve shows the energy absorption (compare this with Figure 3.9). 

(Problems 5.14, 5.15, 5.16, 5.17, 5.18, 5.19) 



Wave Group of Many Components. The Bandwidth Theorem 

We have so far considered wave groups having only two frequency components. We may 
easily extend this to the case of a group of many frequency components, each of amplitude 
a, lying within the narrow frequency range Acu. 

We have already covered the essential physics of this problem on p. 20, where we found 
the sum of the series 



n— 1 

* = E a cos ( Lot + nS) 
o 

where 6 was the constant phase difference between successive components. Flere we are 
concerned with the constant phase difference (6io)t which results from a constant frequency 
difference 8u> between successive components. The spectrum or range of frequencies of this 
group is shown in Figure 5.14a and we wish to follow its behaviour with time. 

We seek the amplitude which results from the superposition of the frequency 
components and write it 

R = acosuqt + a cos (cu i + 8uS)t + a cos (co \ + 2 8uS)t + ■ ■ ■ 

+ acos [uj i + (n — l)(&u)]t 
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Figure 5.14 A rectangular wave band of width A 10 having n frequency components of amplitude a 
with a common frequency difference Slo. (b) Representation of the frequency band on a time axis is a 
cosine curve at the average frequency 10 , amplitude modulated by a sin a/a curve where 
a = Alo ■ t/2. After a time t = 2 tt/ Alo the superposition of the components gives a zero amplitude 




The result is given on p. 21 by 

sin \n(8u))tl 21 

R = a . cos cot 

sm [{6uS)t/ 2] 

where the average frequency in the group or band is 

Co = u)\ +\{n — l)(&u) 

Now n(6u) = Ac o, the bandwidth, so the behaviour of the resultant R with time may be 
written 

sin (Alo ■ t/2) _ sin (Acu ■ t/2) 

R(t) = a — ; — - cos Lot — na cos cut 

w sm (Alo ■ t/n2) Alo -t/2 



when n is large, 
or 

, sin a 

R(t) = A cos cot 

a 
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where A = na and a = Aw • t/2 is half the phase difference between the first and last 
components at time t. 

This expression gives us the time behaviour of the band and is displayed on a time axis in 
Figure 5.14b. We see that the amplitude R(t) is given by the cosine curve of the average 
frequency m modified by the A sin a/ a term. 

At t = 0, sin a/ a — > 1 and all the components superpose with zero phase difference to 
give the maximum amplitude R(t) — A = na. After some time interval At when 

Am At 

a = = 7r 

2 

the phases between the frequency components are such that the resulting amplitude R(t) is 
zero. 

The time At which is a measure of the width of the central pulse of Figure 5.14b is 
therefore given by 

Am At 



or An At = 1 where A m = 2 ttAv. 

The true width of the base of the central pulse is 2At but the interval At is taken as an 
arbitrary measure of time, centred about t — 0, during which the amplitude R(t) remains 
significantly large (> A/2). With this arbitrary definition the exact expression 

An At = 1 



becomes the approximation 



An At & \ or (Am At « 2-7r) 

and this approximation is known as the Bandwidth Theorem. 

It states that the components of a band of width A m in the frequency range will 
superpose to produce a significant amplitude R(t) only for a time At before the band 
decays from random phase differences. The greater the range A m the shorter the period At. 

Alternatively, the theorem states that a single pulse of time duration At is the result of 
the superposition of frequency components over the range Am; the shorter the period At of 
the pulse the wider the range A m of the frequencies required to represent it. 

When Am is zero we have a single frequency, the monochromatic wave which is 
therefore required (in theory) to have an infinitely long time span. 

We have chosen to express our wave group in the two parameters of frequency and time 
(having a product of zero dimensions), but we may just as easily work in the other pair of 
parameters wave number k and distance x. 

Replacing m by k and t by x would define the length of the wave group as Ax in terms of 
the range of component wavelengths A(l/A). 

The Bandwidth Theorem then becomes 



Ax A k k 2n 




Transverse Waves in a Periodic Structure 



135 



or 

AxA(l/A) ~ 1 i.e. Ai« A 2 /AA 

Note again that a monochromatic wave with A k = 0 requires Ax — » oo; that is, an 
infinitely long wavetrain. 

In the wave group we have just considered the problem has been simplified by assuming 
all frequency components to have the same amplitude a. When this is not the case, the 
different values a(uj) are treated by Fourier methods as we shall see in Chapter 10. 

We shall meet the ideas of this section several times in the course of this text, noting 
particularly that in modern physics the Bandwidth Theorem becomes Heisenberg’s 
Uncertainty Principle. 

(Problem 5.20) 



Transverse Waves in a Periodic Structure 



At the end of the chapter on coupled oscillations we discussed the normal transverse 
vibrations of n equal masses of separation a along a light string of length (n + 1 )a under a 
tension T with both ends fixed. The equation of motion of the /111 particle was found to be 

T 

my r = - OV+i + Jr-i ~ 2 y r ) 



and for n masses the frequencies of the normal modes of vibration were given by 






2 T 



ma 



1 — cos 






(4.15) 



where j — 1,2,3,..., n. When the separation a becomes infinitesimally small (= 6x, say) 
the term in the equation of motion 

-(yr+i +y r - i - 2 y r ) -> T-(v r +i +y r - 1 - 2 y r ) 

a ox 

= (y r +i -y r ) _ {yr-yr - Q = _ f #y\ = f 'dV\ ^ 

6x 6x \dx) r+ 1/2 \dx) r _ l/2 \dx 2 J r 



so that the equation of motion becomes 



d 2 y T d 2 y 
dt 2 p dx 2 ’ 



the wave equation, where p — m/8x, the linear density and 

y oc e i(w? “ fa) 

We are now going to consider the propagation of transverse waves along a linear array of 
atoms, mass m, in a crystal lattice where the tension T now represents the elastic force 
between the atoms (so that T ja is the stiffness) and a , the separation between the atoms, is 
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about 1 A or 10 _1(> m. When the clamped ends of the string are replaced by the ends of 
the crystal we can express the displacement of the rth particle due to the transverse 
waves as 



y r =A r e i(w, - fa) =A l .e i( " , - fao) , 

since x = ra. The equation of motion then becomes 

—u) 2 m = — (e ika + e~ ika — 2) 

= -(e ifa,/ 2 - e~ ika / 2 ) 2 = - —sin 2 — 
a a 2 

giving the permitted frequencies 



9 AT . , ka 
uj = — sin — 
ma 2 



(5.11) 



This expression for a; 2 is equivalent to our earlier value at the end of Chapter 4: 






2 T 



ma 



1 — cos 



AT . 2 ,/7T 

= — sin 



77T 

n + \J ma 2 (n + 1) 



(4.15) 



if 



ka jii 
~2 ~ 2 (n + 1) 

where j — 1, 2, 3 

But (n + l)n = /, the length of the string or crystal, and we have seen that wavelengths 
A are allowed where p\/2 — l — (n + l)n. 

Thus 

ka 2 tt a ira jair j n a 
2 A2A 2 (n + 1 )a p A 

if j = p. When j = p, a unit change in j corresponds to a change from one allowed number 
of half wavelengths to the next so that the minimum wavelength is A = 2a, giving a 
maximum frequency iu 2 = AT /ma. Thus, both expressions may be considered equivalent. 

When A = 2a, sin ka/ 2 — 1 because ka = n, and neighbouring atoms are exactly n rad 
out of phase because 



y r + 1 

The highest frequency is thus associated with maximum coupling, as we expect. 

If in equation (5.11) we plot |sinfar/2| against k (Figure 5.15) we find that when ka 
is increased beyond 7r the phase relationship is the same as for a negative value of 
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Figure 5.15 |siriy | versus k from equation (5.11) shows the repetition of values beyond the region 
< k < f; this region defines a Brillouin zone 



ka beyond — 7r. It is, therefore, sufficient to restrict the values of k to the region 

— 7T , 7T 

— <k<- 
a a 

which is known as the first Brillouin zone. We shall use this concept in the section on 
electron waves in solids in Chapter 13. 

For long wavelengths or low values of the wave number k, sin ka/2 — > ka / 2 so that 

o 4 Tk 2 a 2 

LO- = — 

ma 4 

and the velocity of the wave is given by 

2 lo 2 Ta T 

k 2 m p 



as before, where p = m/a. 

In general the phase velocity is given by 




sin ka/2 
ka/2 



(5.12) 



a dispersion relation which is shown in Figure 5.16. Only at very short wavelengths does 
the atomic spacing of the crystal structure affect the wave propagation, and here the limiting 
or maximum value of the wave number k m = n/a ~ 10 10 m 

The elastic force constant T /a for a crystal is about 15 Nm -1 ; a typical ‘reduced’ atomic 
mass is about 60 x 1 0 27 kg. These values give a maximum frequency 



AT _ 60 

ma 60 x 10~ 27 



10 27 rads -1 



that is, a frequency v k, 5 x 10 12 Hz. 
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Figure 5.16 The dispersion relation ui(k) versus k for waves travelling along a linear one- 
dimensional array of atoms in a periodic structure 

(Note that the value of T /a used here for the crystal is a factor of 8 lower than that found 
in Problem 4.4 for a single molecule. This is due to the interaction between neighbouring 
ions and the change in their equilibrium separation.) 

This frequency is in the infrared region of the electromagnetic spectrum. We shall see in 
a later chapter that electromagnetic waves of frequency u> have a transverse electric held 
vector E = Eq e lwr , where Eq is the maximum amplitude, so that charged atoms or ions in a 
crystal lattice could respond as forced oscillators to radiation falling upon the crystal, 
which would absorb any radiation at the resonant frequency of its oscillating atoms. 

Linear Array of Two Kinds of Atoms in an Ionic Crystal 

We continue the discussion of this problem using a one dimensional line which contains 
two kinds of atoms with separation a as before, those atoms of mass M occupying the odd 
numbered positions, 2 r — 1, 2r + 1, etc. and those of mass m occupying the even numbered 
positions, 2 r, 2 r + 2, etc. The equations of motion for each type are 

T 

myir = -{yir+l+ yir-l ~ 2v 2 r) 

and 

T 

My 2 r+l =-{yir+2 + V2r ~ 2y 2 r+\) 

with solutions 

y 2r = A m e i ( ut ~ 2rka '> 

y 2r+ i ^Me i( “'- (H1W 

where A m and A M are the amplitudes of the respective masses. 

The equations of motion thus become 

9 . TAm 2TA m 

-ui mA m = (e + e ) 

a a 






Figure 5.17 Dispersion relations for the two modes of transverse oscillation in a crystal structure 
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Figure 5.18 The displacements of the different atomic species in the two modes of transverse 
osciLlations in a crystaL structure (a) the optical mode, and (b) the acoustic mode 



The upper curve is called the ‘optical’ branch and the lower curve is known as the 
‘acoustical’ branch. The motions of the two types of atom for each branch are shown in 
Figure 5.18. 

In the optical branch for long wavelengths and small k. A m j A m = —M/m, and the atoms 
vibrate against each other, so that the centre of mass of the unit cell in the crystal remains 
fixed. This motion can be generated by the action of an electromagnetic wave when 
alternate atoms are ions of opposite charge; hence the name ‘optical branch’. In the 
acoustic branch, long wavelengths and small k give A m — A M , and the atoms and their 
centre of mass move together (as in longitudinal sound waves). We shall see in the next 
chapter that the atoms may also vibrate in a longitudinal wave. 

The transverse waves we have just discussed are polarized in one plane; they may also 
vibrate in a plane perpendicular to the plane considered here. The vibrational energy of 
these two transverse waves, together with that of the longitudinal wave to be discussed in 
the next chapter, form the basis of the theory of the specific heats of solids, a topic to which 
we shall return in Chapter 9. 



Absorption of Infrared Radiation by Ionic Crystals 

Radiation of frequency 3 x 10 12 Hz. gives an infrared wavelength of 100 pm (10 ~ 4 m) and 
a wave number k = 2tt/A ~ 6.10 4 m l . We found the cut-off frequency in the crystal 
lattice to give a wave number k m « 10 10 m -1 , so that the k value of infrared radiation is a 
negligible quantity relative to k m and may be taken as zero. When the ions of opposite 
charge ±e move under the influence of the electric field vector E = E oe“ of 
electromagnetic radiation, the equations of motion (with k = 0) become 

2 j 

lu mA m — (Am A n /j eE@ 
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and 



— UJ MA m — 



-2 T 



{A m — A m ) + eEo 



which may be solved to give 



Am 



eE o 

M{u 2 - ut 2 ) 



and A m 



-e E o 

m — ut 2 ) 



where 



2 _ 2T ( 1 1 

the low k limit of the optical branch. 

Thus, when ut = uiq infrared radiation is strongly absorbed by ionic crystals and the ion 
amplitudes A M and A m increase. Experimentally, sodium chloride is found to absorb 
strongly at A = 61 pm; potassium chloride has an absorption maximum at A = 71 pm. 

(Problem 5.21) 

Doppler Effect 

In the absence of dispersion the velocity of waves sent out by a moving source is constant 
but the wavelength and frequency noted by a stationary observer are altered. 

In Figure 5.19 a stationary source S emits a signal of frequency v and wavelength A for a 
period t so the distance to a stationary observer O is vXt. If the source S' moves towards O 
at a velocity u during the period t then O registers a new frequency v'. 

We see that 

vXt = ut + vX!t 



vtX 



S 




o 



vtX' 



> 



ut 




O 



Figure 5.19 If waves from a stationary source 5 are received by a stationary observer 0 at frequency 
v and wavelength A the frequency is observed as u' and the wavelength as A' at 0 if the source S' 
moves during transmission. This is the Doppler effect 
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which, for 



c = v A = u'X' 



gives 



Hence 



c — u 

This observed change of frequency is called the Doppler Effect. 

Suppose that the source S is now stationary but that an observer O' moves with a velocity 
v away from S. If we superimpose a velocity —v on observer, source and waves, we bring 
the observer to rest; the source now has a velocity — v and waves a velocity of c — v. 
Using these values in the expression for v' gives a new observed frequency 




v „ _ ~ v) 

c 



(Problems 5.22, 5.23, 5.24, 5.25, 5.26, 5.27, 5.28, 5.29, 5.30, 5.31) 



Problem 5.1 

Show that y = fi(ct + x) is a solution of the wave equation 

d 2 y 1 d 2 y 

dx 2 c 2 dt 2 



Problem 5.2 

Show that the wave profile; that is, 

y =f\{ct~x) 

remains unchanged with time when c is the wave velocity. To do this consider the expression for y at 
a time t + At where At = Ax/c. 

Repeat the problem for y =f 2 (ct + x). 

Problem 5.3 

Show that 

dy dy 
dt ” I_C dx 

for a left-going wave drawing a diagram to show the particle velocities as in Figure 5.5 (note that c is 
a magnitude and does not change sign). 

Problem 5.4 

A triangular shaped pulse of length l is reflected at the fixed end of the string on which it travels 
(Z 2 = oo). Sketch the shape of the pulse (see Figure 5.8) after a length (a) 1/4 (b) 1/2 (c) 31/4 and 
(d) l of the pulse has been reflected. 
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Problem 5.5 

A point mass M is concentrated at a point on a string of characteristic impedance pc. A transverse 
wave of frequency uj moves in the positive x direction and is partially reflected and transmitted at the 
mass. The boundary conditions are that the string displacements just to the left and right of the mass 
are equal (y,- + y r = } v) and that the difference between the transverse forces just to the left and 
right of the mass equal the mass times its acceleration. If A i, Si and A 2 are respectively the incident, 
reflected and transmitted wave amplitudes show that 

B 1 — i q , A 2 1 

— = and — = 

A 1 1 + 117 A 1 1 + 117 

where q = uiM/lpc and i 2 = — 1. 

Problem 5.6 

In problem 5.5, writing q = tan 9, show that A 2 lags A 1 by 9 and that B \ lags A 1 by (tt/2 + 9) for 
0 <9 < tt/2. 

Show also that the reflected and transmitted energy coefficients are represented by sin 2 9 and 
cos 2 9, respectively. 

Problem 5.7 

If the wave on the string in Figure 5.6 propagates with a displacement 

y = a sin (uit — kx ) 

Show that the average rate of working by the force (average value of transverse force times 
transverse velocity) equals the rate of energy transfer along the string. 

Problem 5.8 

A transverse harmonic force of peak value 0.3 N and frequency 5 Hz initiates waves of amplitude 
0.1 m at one end of a very long string of linear density 0.01 kg/m. Show that the rate of energy 
transfer along the string is 37r/20 W and that the wave velocity is 30/-7rms _1 . 

Problem 5.9 

In the figure, media of impedances Z 1 and Z3 are separated by a 
medium of intermediate impedance Z2 and thickness A/4 mea- 
sured in this medium. A normally incident wave in the first 
medium has unit amplitude and the reflection and transmission 
coefficients for multiple reflections are shown. Show that the total 
reflected amplitude in medium 1 which is 

R + tTR'{\ + rR' + r 2 R' 2 ...) 



is zero at R = R' and show that this defines the condition 



Z\ = Z1Z3 

(Note that for zero total reflection in medium 1, the first reflection 
R is cancelled by the sum of all subsequent reflections.) 
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Problem 5.10 

The relation between the impedance Z and the refractive index n of a dielectric is given by Z = 1 /n. 
Light travelling in free space enters a glass lens which has a refractive index of 1.5 for a free space 
wavelength of 5.5 x 10 -7 m. Show that reflections at this wavelength are avoided by a coating of 
refractive index 1.22 and thickness 1.12 x 10~ 7 m. 



Problem 5.11 

Prove that the displacement v„ of the standing wave expression in equation (5.10) satisfies the time 
independent form of the wave equation 



d 2 y 
dx 2 



+ k 2 y = 0. 



Problem 5.12 

The total energy E n of a normal mode may be found by an alternative method. Each section dv of the 
string is a simple harmonic oscillator with total energy equal to the maximum kinetic energy of 
oscillation 



k -e- max = \pAx{yl) max = 5 P' n (? I ) max 



Now the value of (y 2 ) max at a point x on the string is given by 



= sin 2 ^ 



Show that the sum of the energies of the oscillators along the string; that is, the integral 



gives the expected result. 



5 P u l 



Cf^maxd* 



Problem 5.13 

The displacement of a wave on a string which is fixed at both ends is given by 

y(x, t) = A cos (u)t — kx) + rA cos (u>t + kx) 

where r is the coefficient of amplitude reflection. Show that this may be expressed as the 
superposition of standing waves 

y(x, t) = A(1 + r) cos cot cos kx + A(1 — r) sin cot sin kx. 



Problem 5.14 

A wave group consists of two wavelengths A and A + A A where A A/ A is very small. 

Show that the number of wavelengths A contained between two successive zeros of the modulating 
envelope is ~ A/AA. 

Problem 5.15 

The phase velocity v of transverse waves in a crystal of atomic separation a is given by 

sin (kci/ 2)\ 

(ka/2) ) 



V — c 
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where k is the wave number and c is constant. Show that the value of the group velocity is 

ka 

c cos — 

2 

What is the limiting value of the group velocity for long wavelengths? 

Problem 5.16 

The dielectric constant of a gas at a wavelength A is given by 

c 2 B 9 

£ r =^ = A + —^ — DX~ 
v l X z 

where A, B and D are constants, c is the velocity of light in free space and v is its phase velocity. If 
the group velocity is V g show that 



V g e r = v(A - 2DX 2 ) 



Problem 5.17 

Problem 3.10 shows that the relative permittivity of an ionized gas is given by 




where v is the phase velocity, c is the velocity of light and io e is the constant value of the 
electron plasma frequency. Show that this yields the dispersion relation u> 2 = u> 2 + c 2 k 2 , 
and that as u — > u e the phase velocity exceeds that of light, c, but that the group velocity 
(the velocity of energy transmission) is always less than c. 

Problem 5.18 

The electron plasma frequency of Problem 5.17 is given by 



e m e £ o ’ 

Show that for an electron number density n e ~ 10 2 °(10~ 5 of an atmosphere), electromagnetic waves 
must have wavelengths A < 3 x 10 _3 m (in the microwave region) to propagate. These are typical 
wavelengths for probing thermonuclear plasmas at high temperatures. 

s o = 8.8 x 10 -12 Fm4 
m e = 9.1 x 10~ 31 kg 
e = 1.6 x ltr 19 C 



Problem 5.19 

In relativistic wave mechanics the dispersion relation for an electron of velocity v = Tik/m is given 
by lj 2 /c 2 = k 2 + m 2 c 2 /ft 2 , where c is the velocity of light, m is the electron mass (considered 
constant at a given velocity) ft = Ii/2tt and ft is Planck’s constant. Show that the product of the group 
and particle velocities is c 2 . 
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Problem 5.20 

The figure shows a pulse of length At given by y = Acostuo?- 
Show that the frequency representation 

y(c S) = a cos w if + a cos (wj + 6u>)t • ■ • + a cos [aq + (n — l)(&u)]r 

is centred on the average frequency ui o and that the range of frequencies making significant 
contributions to the pulse satisfy the criterion 

Au) At w 2n 

Repeat this process for a pulse of length Ax with y = A cos kox to show that in k space the pulse is 
centred at k o with the significant range of wave numbers A k satisfying the criterion Ax Ak ~ 27r. 




Problem 5.21 

The elastic force constant for an ionic crystal is ~ 15 N m _1 . Show that the experimental values for 
the frequencies of infrared absorption quoted at the end of this chapter for NaCl and KC1 are in 
reasonable agreement with calculated values. 

1 a.m.u. = 1.66 x 10~ 27 kg 
Na mass = 23 a.m.u. 

K mass = 39 a.m.u. 

Cl mass = 35 a.m.u. 



Problem 5.22 

Show that, in the Doppler effect, the change of frequency noted by a stationary observer O as a 
moving source S' passes him is given by 

2 i/cu 



where c = v\, the signal velocity and u is the velocity of S' . 

Problem 5.23 

Suppose, in the Doppler effect, that a source S' and an observer O' move in the same direction with 
velocities u and v, respectively. Bring the observer to rest by superimposing a velocity — v on the 
system to show that O' now registers a frequency 

,// = ~ v) 

(c — u) 



Problem 5.24 

Light from a star of wavelength 6 x 10 “ 7 m is found to be shifted 10 ~ n m towards the red when 
compared with the same wavelength from a laboratory source. If the velocity of light is 
3 x 10 8 ms -1 show that the earth and the star are separating at a velocity of 5 Km s -1 . 
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Problem 5.25 

An aircraft flying on a level course transmits a signal of 3 x 10 9 Hz which is reflected from a distant 
point ahead on the flight path and received by the aircraft with a frequency difference of 15 kHz. 
What is the aircraft speed? 

Problem 5.26 

Light from hot sodium atoms is centred about a wavelength of 6 x 10 ~ 7 m but spreads 2 x 10 -12 m 
on either side of this wavelength due to the Doppler effect as radiating atoms move towards and 
away from the observer. Calculate the thermal velocity of the atoms to show that the gas temperature 
is ~ 900 K. 



Problem 5.27 

Show that in the Doppler effect when the source and observer are not moving in the same direction 
that the frequencies 



v 




v(c — v ) 
c 



and 




are valid if u and v are not the actual velocities but the components of these velocities along the 
direction in which the waves reach the observer. 



Problem 5.28 

In extending the Doppler principle consider the accompanying figure where O is a stationary 
observer at the origin of the coordinate system 0( x, t) and O' is an observer situated at the origin of 
the system O' (x',t') which moves with a constant velocity v in the x direction relative to the system 
O. When O and O' are coincident at t = t' = 0 a light source sends waves in the x direction with 
constant velocity c. These waves obey the relation 





0 = x 1 — c 2 f 2 (seen by O) = x' 1 — cV 2 (seen by O'). 


(1) 


Since there is 


only one relative velocity v, the transformation 




and 


x' = k(x — vt) 


(2) 




x = k'(x' + vt') 


(3) 



must also hold. Use (2) and (3) to eliminate x' and t' from (1) and show that this identity is satisfied 
only by k = k' = 1/(1 — (3 2 ) *^ 2 , where fJ = v/c. (Hint — in the identity of equation (1) equate 
coefficients of the variables to zero.). 



0 (xt) 



0' (x'P) 



vt 



0 



v 



0 ' 
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This is the Lorentz transformation in the theory of relativity giving 



, (x — vt) x' + vt' 

x - (1 -p 2 ) l/2 ' x ~{l-p 2 ) l/2 
, = (t - ( v/c 2 )x ) = (t' + (v/c 2 )x') 

(1 -I3 2 ) l/2 ’ (1 - /3 2 ) 1/2 



Problem 5.29 

Show that the interval At = t 2 — t \ seen by O in Problem 5.28 is seen as At' = kAt by O' and that 
the length / = xi — X\ seen by O is seen by O' as V = l/k. 



Problem 5.30 

Show that two simultaneous events at X 2 and x i (f 2 = f 1 ) seen by O in the previous problems are not 
simultaneous when seen by O' (that is, t\ t' 2 ). 

Problem 5.31 

Show that the order of events seen by 0(t 2 > t\) of the previous problems will not be reversed 
when seen by O' (that is, t' 2 > t[) as long as the velocity of light c is the greatest velocity 
attainable. 



Summary of Important Rules 



d 2 y 1 d 2 y 

Wave Equation 



Wave (phase) velocity = c — 



to 



k = wave number = — 
A 



dx 

dt 



where the wavelength A defines separation between two oscillations with phase difference 
of 2-7r rad. 



Particle velocity 



dy 

dt 




Displacement y = kx> . 

where a is wave amplitude. 



Characteristic Impedance of a String 

transverse force ^dy t dy 

transverse velocity dx / dt ^ 
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Reflection and Transmission Coefficients 



Reflected Amplitude 
Incident Amplitude 

Transmitted Amplitude 
Incident Amplitude 

Reflected Energy 
Incident Energy 

Transmitted Energy 
Incident Energy 



Zi -Z 2 
Z\ + Z 2 



2Z| 

Z\ + Z2 



Z, - z 2 
Z\ + z 2 



2 



4Z t Z 2 

(z, +z 2 ) 2 



Impedance Matching 

Impedances Zi and Z3 are matched by insertion of impedance Z 2 where Z\ — Z1Z3 
Thickness of Z 2 is A/4 measured in Z 2 . 



Standing Waves. Normal Modes. Harmonics 

Solution of wave equation separates time and space dependence to satisfy time independent 
wave equation 



^-4 + k 2 y = 0 (cancel e larf ) 

Standing waves on string of length 1 have wavelength A„ where 



A n , 
n — — l 
2 



Displacement of nth harmonic is 



. . . . U! n X 

y n = (A n cos uj„t + B„ sin uj n t) sin 

c 



Energy of nth harmonic (string mass m) 



E n = KE n + PE n = -mu> 2 (A 2 + B 2 n ) 
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Group Vetocity 

In a dispersive medium the wave velocity v varies with frequency t o (wave number k). The 
energy of a group of such waves travels with the group velocity 



d w kdv dv 

dk dk dA 



Rectangular Wave Group of n Frequency Components Amplitude a. Width Aw, represented 
in time by 



m 



sin (Aw ■ t/2) 
sin (Aw ■ t/n ■ 2) 



cost ot 



where w is average frequency. R(t) is zero when 

Aw ■ t 



i.e. Bandwidth Theorem gives 



Aw ■ At = 2n 



or 



Ax A k = 2ir 

A pulse of duration At requires a frequency band width Aw to define it in frequency space 
and vice versa. 



Doppler Effect 

Signal of frequency v and velocity c transmitted by a stationary source S and received by a 
stationary observer O becomes 

, ISC 

v = 

c — u 

when source is no longer stationary but moves towards O with a velocity u. 




6 

Longitudinal Waves 

In deriving the wave equation 

d 2 y 1 d 2 y 
dx 2 c 2 dt 2 

in Chapter 5, we used the example of a transverse wave and continued to discuss waves of 
this type on a vibrating string. In this chapter we consider longitudinal waves, waves in 
which the particle or oscillator motion is in the same direction as the wave propagation. 
Longitudinal waves propagate as sound waves in all phases of matter, plasmas, gases, 
liquids and solids, but we shall concentrate on gases and solids. In the case of gases, 
limitations of thermodynamic interest are imposed; in solids the propagation will depend 
on the dimensions of the medium. Neither a gas nor a liquid can sustain the transverse 
shear necessary for transverse waves, but a solid can maintain both longitudinal and 
transverse oscillations. 

Sound Waves in Gases 

Let us consider a fixed mass of gas, which at a pressure P o occupies a volume Vo with a 
density p o. These values define the equilibrium state of the gas which is disturbed, or 
deformed, by the compressions and rarefactions of the sound waves. Under the influence of 
the sound waves 



and 



the pressure P o becomes P = P o + p 
the volume Vo becomes V = Vq + v 

the density p o becomes p — p o + pa- 



The excess pressure p m is the maximum pressure amplitude of the sound wave and p is an 
alternating component superimposed on the equilibrium gas pressure Pq. 
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The fractional change in volume is called the dilatation, written v/Vq = 6, and the 
fractional change of density is called the condensation, written Pd/ Po = s - The values of 6 
and ^ are « 10 ~ 3 for ordinary sound waves, and a value of p m = 2 x 10 5 Nm 2 (about 
10 1(1 of an atmosphere) gives a sound wave which is still audible at 1000 Hz. Thus, the 
changes in the medium due to sound waves are of an extremely small order and define 
limitations within which the wave equation is appropriate. 

The fixed mass of gas is equal to 

PoV 0 = P V = p Q Vo{l + 8){l + s) 

so that (l + 6 )(l+s)= 1, giving s = —6 to a very close approximation. The elastic 
property of the gas, a measure of its compressibility, is defined in terms of its bulk modulus 

d P d P 

B = — = -V — 

dV/V dV 

the difference in pressure for a fractional change in volume, a volume increase with fall in 
pressure giving the negative sign. The value of B depends on whether the changes in the gas 
arising from the wave motion are adiabatic or isothermal. They must be thermodynamically 
reversible in order to avoid the energy loss mechanisms of diffusion, viscosity and thermal 
conductivity. The complete absence of these random, entropy generating processes defines 
an adiabatic process, a thermodynamic cycle with a 100% efficiency in the sense that none 
of the energy in the wave, potential or kinetic, is lost. In a sound wave such thermodynamic 
concepts restrict the excess pressure amplitude; too great an amplitude raises the local 
temperature in the gas at the amplitude peaks and thermal conductivity removes energy 
from the wave system. Local particle velocity gradients will also develop, leading to 
diffusion and viscosity. 

Using a constant value of the adiabatic bulk modulus limits sound waves to small 
oscillations since the total pressure P — Pq +p is taken as constant; larger amplitudes lead 
to non-linear effects and shock waves, which we shall discuss separately in Chapter 15. 

All adiabatic changes in the gas obey the relation PV 7 = constant, where 7 is the ratio of 
the specific heats at constant pressure and volume, respectively. 

Differentiation gives 

V 7 dP + 7PV 7-1 dV = 0 
or 

d P 

— V — = 7 P = B a (where the subscript a denotes adiabatic) 
dV 

so that the elastic property of the gas is 7P, considered to be constant. Since P = P 0 + p, 
then dP = p , the excess pressure, giving 

B a = P 17 or p = —B a 8 = B a s 

v/Vo 

In a sound wave the particle displacements and velocities are along the x-axis and we 
choose the co-ordinate rj to define the displacement where r/(x, t). 

In obtaining the wave equation we consider the motion of an element of the gas of 
thickness Ax and unit cross section. Under the influence of the sound wave the behaviour 
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Ax 



r| + Ar| 

< > 



dP, 

dx 



AX 



Figure 6.1 Thin element of gas of unit cross-section and thickness Ax displaced an amount 77 and 
expanded by an amount ( 6 r]/dx)Ax under the influence of a pressure differene —(dP x /dx ) Ax 



of this element is shown in Figure 6. 1 . The particles in the layer x are displaced a distance ry 
and those at x + Ax are displaced a distance ry + Ary, so that the increase in the thick- 
ness Ax of the element of unit cross section (which therefore measures the increase in 
volume) is 



A A 

Ary = — Ax 
ox 



and 



6 = 



v 




—s 



where dp/8x is called the strain. 

The medium is deformed because the pressures along the x-axis on either side of the thin 
element are not in balance (Figure 6.1). The net force acting on the element is given by 



P x P x+ Ax — 



Px - P 



dP> 

dx 



Ax 






dp a 

- ,, Ax 
ox 



The mass of the element is poAx and its acceleration is given, to a close approxmation, by 
<9 2 ?y/d t 2 . 

From Newton’s Law we have 



dp 

dx 



Ax = poAx 



d 2 ri 

W 
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where 



so that 



P = ~B a 6 



-B n 



&il 

dx 



dp „ d 2 V 

dx a dx 2 ’ 



giving 



d 2 r] d 2 r] 

Ba lh 2 = P0 ~di 2 



But B a / po = 7 P/ pa is the ratio of the elasticity to the inertia or density of the gas, and this 
ratio has the dimensions 



force volume . 2 7 P 2 

= (velocity) - , so — — c 

area mass p 0 

where c is the sound wave velocity. 

Thus 

d 2 y 1 d 2 y 

dx 2 c 2 dt 2 



is the wave equation. Writing 77,,, as the maximum amplitude of displacement we have the 
following expressions for a wave in the positive x-direction: 



V = Vme i{u *~ kx) y = ^ = ivuy 
dt 

6 — — - = —iky = —s (so .9 = iky) 
dx 

p = B a s = iB a ky 



The phase relationships between these parameters (Figure 6.2a) show that when the wave is 
in the positive x-direction, the excess pressure p, the fractional density increase .9 and the 
particle velocity y are all 7t/2 rad in phase ahead of the displacement y, whilst the volume 
change (tr rad out of phase with the density change) is 7r/2 rad behind the displacement. 
These relationships no longer hold when the wave direction is reversed (Figure 6.2b); for a 
wave in the negative x-direction 



y — y m e i ( u * +foc ) r, = % = **! 

6 — — - = — iA'?? = —9 (so 9 = iky) 
dx 



p = B a s — —iB a ky 



In both waves the particle displacement y is measured in the positive x-direction and the 
thin element Ax of the gas oscillates about the value y — 0, which defines its central 
position. For a wave in the positive x-direction the value y = 0, with y a maximum in the 
positive x-direction, gives a maximum positive excess pressure ( compression ) with a 
maximum condensation s m (maximum density) and a minimum volume. For a wave in the 
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P,s,;) 



n 



91,8 

dx 



V 



> 



;; 



91,6 

dx 



Wave in +ve x 
direction 

(a) 



T AS 

Wave in -ve x 
direction 

(b) 



Figure 6.2 Phase relationships between the particle displacement tj, particle velocity i), excess 
pressure p and condensation s = —6 (the dilatation) for waves travelling in the positive and 
negative x directions. The displacement q is taken in the positive x direction for both waves 



negative x-direction, the same value // = 0, with fj a maximum in the positive x-direction, 
gives a maximum negative excess pressure ( rarefaction ), a maximum volume and a 
minimum density. To produce a compression in a wave moving in the negative x-direction 
the particle velocity p must be a maximum in the negative x-direction at r] = 0. This 
distinction is significant when we are defining the impedance of the medium to the waves. 
A change of sign is involved with a change of direction — a convention we shall also have 
to follow when discussing the waves of Chapters 7 and 8. 

Energy Distribution in Sound Waves 

The kinetic energy in the sound wave is found by considering the motion of the individual 
gas elements of thickness Ax. 

Each element will have a kinetic energy per unit cross section 

AEkin = ipo Ax ?) 2 

where rj will depend upon the position x of the element. The average value of the kinetic 
energy density is found by taking the value of f] 2 averaged over a region of n wavelengths. 
Now 

.27 r 

r ] = p m sin — (ct — x) 

A 

so that 

ITT _ Vi Jo A sin2 - x)/\ Ax _ j . 2 

1 ~ 77 A ~~ llm 

so that the average kinetic energy density in the medium is 

A£ kin = \povi = 
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(a simple harmonic oscillator of maximum amplitude a has an average kinetic energy over 
one cycle of ^mw 2 a 2 ). 

The potential energy density is found by considering the work P dV done on the fixed 
mass of gas of volume To during the adiabatic changes in the sound wave. This work is 
expressed for the complete cycle as 



AE pot = - 



PdV = - 



Y r 2-7T 

— pvd(ujt) 
2ft J o 



Pm^m 
2 : 



P_ 

Pm 



— = sin(u;£ — kx ) 

u m 



The negative sign shows that the potential energy change is positive in both 
a compression ( p positive, dV negative) and a rarefaction ( p negative, dV positive) 
Figure 6.3. 

The condensation 



J\lu 



Vo 




-h 



we write 



s 



s 



m 



-6 



sin (cut — kx) and — v = Vq^ 



which, with 



P = B a s 



gives 



o27T 

A £ pot = — - pvd(pjt) 

2?r Jo 



B a V Q r 
2tt Jo 



5 2 d(tuf) 




Figure 6.3 Shaded triangles show that potential energy = gained by gas in compression 
equals that gained in rarefaction when both p and v change sign 
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Figure 6.4 Energy distribution in space for a sound wave in a gas. Both potential and kinetic 
energies are at a maximum when the particle velocity f] is a maximum and zero at f/ = 0 

where s — —6 and the thickness Ax of the element of unit cross section represents its 
volume Vo- 
Now 



V = Vin e i(wr±fa) 



so that 



Thus 



_ <9 ? 7 1 drj 

dx c dt ' 



where c — — 
k 



1 B a , A 1 , A 

A£ pot = --^fj-Ax = - p Q r] 2 Ax 



and its average value over n A gives the potential energy density 

A£ pot = \povi 



We see that the average values of the kinetic and potential energy density in the sound 
wave are equal, but more important, since the value of each for the element Ax is 
\pqT] 2 Ax, we observe that the element possesses maximum (or minimum) potential and 
kinetic energy at the same time. A compression or rarefaction produces a maximum in the 
energy of the element since the value f] governs the energy content. Thus, the energy in the 
wave is distributed in the wave system with distance as shown in Figure 6.4. Note that this 
distribution is non-uniform with distance unlike that for a transverse wave. 



Intensity of Sound Waves 

This is a measure of the energy flux, the rate at which energy crosses unit area, so that it is 
the product of the energy density (kinetic plus potential) and the wave velocity c. Normal 
sound waves range in intensity between 10 12 and 1 W m 2 , extremely low levels which 
testify to the sensitivity of the ear. The roar of a large football crowd greeting a goal will 
just about heat a cup of coffee. 
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The intensity may be written 

/ = {pocril = \poCUt 2 T) 2 m = pocrilns = Pr m J POC = PnnsVrms 

A commonly used standard of sound intensity is given by 

7 0 = 10' 2 Wm' 2 

which is about the level of the average conversational tone between two people standing 
next to each other. Shouting at this range raises the intensity by a factor of 100 and in the 
range 100 Iq to 1000 Iq (10 W m~ 2 ) the sound is painful. 

Whenever the sound intensity increases by a factor of 10 it is said to have increased by 
1 B so the dynamic range of the ear is about 12 B. An intensity increase by a factor of 

10° 1 =1-26 

increases the intensity by 1 dB, a change of loudness which is just detected by a person 
with good hearing. dB is a decibel. 

We see that the product poc appears in most of the expressions for the intensity; its 
significance becomes apparent when we define the impedance of the medium to the waves 
as the 



excess pressure p 

Specific Acoustic Impedance = — — : — = - 

particle velocity p 



(the ratio of a force per unit area to a velocity). 
Now, for a wave in the positive x-direction. 



so that. 



p = B cl s = \B a kp and p = iuip 



P B a k B a 

. - = = PoC 

pi 0 c 



Thus, the acoustic impedance presented by the medium to these waves, as in the case of the 
transverse waves on the string, is given by the product of the density and the wave velocity 
and is governed by the elasticity and inertia of the medium. For a wave in the negative 
x-direction, the specific acoustic impedance 

P iB a kp 

~ = : = “POC 

p 1C op 

with a change of sign because of the changed phase relationship. 

The units of poc are normally stated as kg m 2 s ~ 1 in books on practical acoustics; in 
these units air has a specific acoustic impedance value of 400, water a value of 1.45x10 6 
and steel a value of 3.9 x 10 7 . These values will become more significant when we use them 
later in examples on the reflection and transmission of sound waves. 
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Although the specific acoustic impedance poc is a real quantity for plane sound waves, it 
has an added reactive component ik/r for spherical waves, where r is the distance travelled 
by the wavefront. This component tends to zero with increasing r as the spherical wave 
becomes effectively plane. 

(Problems 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8) 

Longitudinal Waves in a Solid 

The velocity of longitudinal waves in a solid depends upon the dimensions of the specimen 
in which the waves are travelling. If the solid is a thin bar of finite cross section the analysis 
for longitudinal waves in a gas is equally valid, except that the bulk modulus B a is replaced 
by Young’s modulus Y, the ratio of the longitudinal stress in the bar to its longitudinal 
strain. 

The wave equation is then 



d 2 r] 1 d 2 r] 
dx 2 c 2 dt 2 ’ 



with c 2 



Y 

P 



A longitudinal wave in a medium compresses the medium and distorts it laterally. 
Because a solid can develop a shear force in any direction, such a lateral distortion is 
accompanied by a transverse shear. The effect of this upon the wave motion in solids of 
finite cross section is quite complicated and has been ignored in the very thin specimen 
above. In bulk solids, however, the longitudinal and transverse modes may be considered 
separately. 

We have seen that the longitudinal compression produces a strain dq/dx\ the 
accompanying lateral distortion produces a strain 3(3/ dy (of opposite sign to dq/dx and 
perpendicular to the x-direction). 

Here (3 is the displacement in the y-direction and is a function of both x and y. The ratio 
of these strains 

3(3 jdr\ 
dy / dx a 

is known as Poisson's ratio and is expressed in terms of Lame’s elastic constants A and // 
for a solid as 

A crY 

cr — xvx r where A — — — — - 

2(A + p) (1 + ct)(1 — 2<t) 

These constants are always positive, so that a <v and is commonly « L In terms of these 
constants Young’s modulus becomes 

Y = (A + 2p - 2Acr) 

The constant p is the transverse coefficient of rigidity; that is, the ratio of the transverse 
stress to the transverse strain. It plays the role of the elasticity in the propagation of pure 
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Figure 6.5 Shear in a bulk solid producing a transverse wave. The transverse shear strain is df3/dx 
and the transverse shear stress is pdfi/dx, where p is the shear modulus of rigidity 



transverse waves in a bulk solid which Young’s modulus plays for longitudinal waves in 
a thin specimen. Figure 6.5 illustrates the shear in a transverse plane wave, where the 
transverse strain is defined by d/3/dx. The transverse stress at x is therefore T x — // 33 / dx. 
The equation of transverse motion of the thin element dx is then given by 



where p is the density, or 



but y — d 2 /3/dt 2 , hence 



T x +tLx — T c Lv = pdxy 



d_ 

dx 




py 



d 2 f3 p d 2 p 

~dx 2 ~~P~di 2 

the wave equation with a velocity given by c 2 = p/p. 

The effect of the transverse rigidity p is to stiffen the solid and increase the elastic 
constant governing the propagation of longitudinal waves. In a bulk solid the velocity of 
these waves is no longer given by c 2 = Y / p, but becomes 

j AT 2p 

c = - 

P 



Since Young’s modulus Y = X + 2p — 2Xa , the elasticity is increased by the amount 
2\a k. A, so that longitudinal waves in a bulk solid have a higher velocity than the same 
waves along a thin specimen. 

In an isotropic solid, where the velocity of propagation is the same in all directions, 
the concept of a bulk modulus, used in the discussion on waves in gases, holds equally 
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well. Expressed in terms of Lame’s elastic constants the bulk modulus for a solid is 
written 



B = \ + \n= F[3(l — 2 <t)] 



the longitudinal wave velocity for a bulk solid becomes 



^+( 4 / 3 )^ 1/2 



whilst the transverse velocity remains as 




Application to Earthquakes 

The values of these velocities are well known for seismic waves generated by earthquakes. 
Near the surface of the earth the longitudinal waves have a velocity of 8 km s _1 and the 
transverse waves travel at 4.45 km s -1 . The velocity of the longitudinal waves increases 
with depth until, at a depth of about 1800 miles, no waves are transmitted because of a 
discontinuity and severe mismatch of impedances associated with the fluid core. 

At the surface of the earth the transverse wave velocity is affected by the fact that stress 
components directed through the surface are zero there and these waves, known as 
Rayleigh Waves, travel with a velocity given by 



c =/(ct) 




1/2 



where 



f(a) = 0.9194 when cr = 0 ■ 25 



and 



f(a) = 0.9553 when <r = 0 ■ 5 

The energy of the Rayleigh Waves is confined to two dimensions; their amplitude is 
often much higher than that of the three dimensional longitudinal waves and therefore they 
are potentially more damaging. 

In an earthquake the arrival of the fast longitudinal waves is followed by the Rayleigh 
Waves and then by a complicated pattern of reflected waves including those affected by the 
stratification of the earth’s structure, known as Love Waves. 
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(Problem 6.9) 

Longitudinal Waves in a Periodic Structure 

Lame’s elastic constants, A and /j, which are used to define such macroscopic quantities as 
Young’s modulus and the bulk modulus, are themselves determined by forces which 
operate over interatomic distances. The discussion on transverse waves in a periodic 
structure has already shown that in a one-dimensional array representing a crystal lattice a 
stiffness s = T/a dyn cm -1 can exist between two atoms separated by a distance a. 

When the waves along such a lattice are longitudinal the atomic displacements from 
equilibrium are represented by 77 (Figure 6 . 6 ). An increase in the separation between two 
atoms from a to a + 77 gives a strain e = 77 /a, and a stress normal to the face area a 2 of a 
unit cell in a crystal equal to srj/ a 2 = se/a, a force per unit area. 

Now Young’s modulus is the ratio of this longitudinal stress to the longitudinal strain, so 
that Y = se/sa or s — Ya. The longitudinal vibration frequency of the atoms of mass m 
connected by stiffness constants s is given, very approximately by 




where m = pa 2 and c 0 is the velocity of sound in a solid. The value of 
Co ~ 5 x 10 3 m s _1 , and aw 2 x 10 ~ 10 m, so that v w 3 x 10 12 Hz, which is almost 
the same value as the frequency of the transverse wave in the infrared region of the 
electromagnetic spectrum. The highest ultrasonic frequency generated so far is about a 
factor of 10 lower than v = co/ltta. At frequencies w 5 x 10 12 to 10 13 Hz many 
interesting experimental results must be expected. A more precise mathematical treatment 
yields the same equation of motion for the r th particle as in the transverse wave; 
namely 



mfj r = *7(?7 r+t + Vr - 1 - 2rir) 



where s — T/a and 



V r — ?7max 
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Figure 6.6 Displacement of atoms in a linear array due to a longitudinal wave in a crysal structure 
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The results are precisely the same as in the case of transverse waves and the shape of the 
dispersion curve is also similar. The maximum value of the cut-off frequency ut m is, 
however, higher for the longitudinal than for the transverse waves. This is because the 
longitudinal elastic constant Y is greater than the transverse constant /j; that is, the force 
required for a given displacement in the longitudinal direction is greater than that for 
the same displacement in the transverse direction. 



Reflection and Transmission of Sound Waves at Boundaries 



When a sound wave meets a boundary separating two media of different acoustic 
impedances two boundary conditions must be met in considering the reflection and 
transmission of the wave. They are that 



and 



(i) the particle velocity fj 



(ii) the acoustic excess pressure p 

are both continuous across the boundary. Physically this ensures that the two media are in 
complete contact everywhere across the boundary. 

Figure 6.7 shows that we are considering a plane sound wave travelling in a medium of 
specific acoustic impedance Z\ = p\C\ and meeting, at normal incidence, an infinite plane 
boundary separating the first medium from another of specific acoustic impedance 
Z 2 = p 2 c 2 ■ If the subscripts i, r and t denote incident, reflected and transmitted 
respectively, then the boundary conditions give 

Vi + Vr = Vt (6.1) 

and 



Pi+Pr=Pt (6.2) 

For the incident wave p x = piCifi and for the reflected wave p r = —p\C\f] r , so equation 
(6.2) becomes 

P\C ifi ~ p\C \ fjx — P2C2Vt 



incident 

► transmitted 



reflected 



Pi f'l P2 ^2 



Figure 6.7 Incident, reflected and transmitted sound waves at a plane boundary between media of 
specific acoustic impedances p\C\ and p 2 c 2 
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or 



Zi77i - 2i?7r = Z 2 ?) t 



(6.3) 



Eliminating fj t from (6.1) and (6.3) gives 

Vi _ UVr _ Vr _ Z\ ~ Z 2 
Vi uVi Vi Z[+Z 2 

Eliminatiing // r from (6.1) and (6.3) gives 

Vi_Vt_ 2Z i 
Vi Vi Zl+Z 2 

Now 

P r _ Zjjr _ Z 2 ~ 2i _ _?)r 
Pi Z\fji Zi+Z 2 r/i 

and 

_ Z 2 7) t _ 2 Z 2 

Pi Z i ?>i Zi + Z 2 

We see that if Zi > Z 2 the incident and reflected particle velocities are in phase, whilst the 
incident and reflected acoustic pressures are out of phase. The superposition of incident and 
reflected velocities which are in phase leads to a cancellation of pressure (a pressure node 
in a standing wave system). If Z\ < Z 2 the pressures are in phase and the velocities are out 
of phase. 

The transmitted particle velocity and acoustic pressure are always in phase with their 
incident counterparts. 

At a rigid wall, where Z 2 is infinite, the velocity // = 0 — 7], + r/ r , which leads to a 
doubling of pressure at the boundary. (See Summary on p. 546.) 

Reflection and Transmission of Sound Intensity 

The intensity coefficients of reflection and transmission are given by 

ir = Zi(v;) Tms = ( Zj-z 2 \ 2 
li Zl(V?)nns \Zi+Z 2 J 

and 

Ii_ z 2 (v;) ims _z 2 / 2Z, \ 2 _ 4Z t Z 2 
li Z\ (flf ) rms Z\ \Zi+Z 2 y (Zi+Z 2 ) 2 

The conservation of energy gives 

7 r I t 

i + T= 1 or 7i=/, + / r 
' i ^ i 
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The great disparity between the specific acoustic impedance of air on the one hand and 
water or steel on the other leads to an extreme mismatch of impedances when the 
transmission of acoustic energy between these media is attempted. 

There is an almost total reflection of sound wave energy at an air-water interface, 
independent of the side from which the wave approaches the boundary. Only 14% of 
acoustic energy can be transmitted at a steel-water interface, a limitation which has severe 
implications for underwater transmission and detection devices which rely on acoustics. 

(Problems 6.10, 6.11, 6.12, 6.13, 6.14, 6.15, 6.16, 6.17) 



Problem 6.1 

Show that in a gas at temperature T the average thermal velocity of a molecule is approximatley 
equal to the velocity of sound. 

Problem 6.2 

The velocity of sound in air of density 1.29 kg m -3 may be taken as 330 m s -1 . Show that the 
acoustic pressure for the painful sound of 10 W nW 2 « 6.5 x 10 of an atmosphere. 

Problem 6.3 

Show that the displacement amplitude of an air molecule at a painful sound level of 10 W m -2 at 
500 Hz « 6.9 x 10 _5 m. 

Problem 6.4 

Barely audible sound in air has an intensity of 10~ lo /o- Show that the displacement amplitude of an 
air molecule for sound at this level at 500 Hz is ~ 10 -10 m; that is, about the size of the molecular 
diameter. 



Problem 6.5 

Hi-fi equipment is played very loudly at an intensity of 100 7o in a small room of cross section 
3 m x 3 m. Show that this audio output is about 10 W. 

Problem 6.6 

Two sound waves, one in water and one in air, have the same intensity. Show that the ratio of their 
pressure amplitudes ( p water tp air) is about 60. When the pressure amplitudes are equal show that 
the intensity ratio is w 3 x 10 ~ 2 . 



Problem 6.7 

A spring of mass m, stiffness s and length L is stretched to a length L + l. When longitudinal waves 
propagate along the spring the equation of motion of a length dr may be written 



pdx 



d 2 r i 
lit 2 



dF 

dx 



dv 



where p is the mass per unit length of the spring, p is the longitudinal displacement and F is the 
restoring force. Derive the wave equation to show that the wave velocity v is given by 



v 2 = s(L + /)/ p 
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Problem 6.8 

In Problem 1.10 we showed that a mass M suspended by a spring of stiffness s and mass m oscillated 
simple harmonically at a frequency given by 



M + m/3 

We may consider the same problem in terms of standing waves along the vertical spring with 
displacement 



r] = (A cos kx + B sin kx ) sin uit 

where k = u)/v is the wave number. The boundary conditions are that r] = 0 at x = 0 (the top of the 
spring) and 



d 2 v dn 

M— y=-sL— at x = L 
ot z ox 



(the bottom of the spring). Show that these lead to the expression 



kL tan kL = 



M 



and expand tan kL in powers of kL to show that, in the second order approximation 



M + m/3 



The value of v is given in Problem 6.7. 



Problem 6.9 

A solid has a Poissons ratio a = 0.25. Show that the ratio of the longitudinal wave velocity to the 
transverse wave velocity is y/3. Use the values of these velocities given in the text to derive an 
appropriate value of a for the earth. 

Problem 6.10 

Show that when sound waves are normally incident on a plane steel water interface 86% of the 
energy is reflected. If the waves are travelling in water and are normally incident on a plane water-ice 
interface show that 82.3% of the energy is transmitted. 

(pc values in kg m~ 2 s -1 ) 

water = 1.43 x 10 6 
ice = 3.49 x 10 6 
steel = 3.9 x 10 7 
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Problem 6.11 

Use the boundary conditions for standing acoustic waves in a tube to confirm the following: 





Particle displacement 


Pressure 




closed end 


open end 


closed end 


open end 


Phase change on reflection 


180° 


0 


0 


180° 


node 


antinode 


antinode 


node 



Problem 6.12 

Standing acoustic waves are formed in a tube of length 1 with (a) both ends open and (b) one end 
open and the other closed. If the particle displacement 

77 = (A cos kx + B sin kx) sin uit 

and the boundary conditions are as shown in the diagrams, show that for 

(a) r) = A cos kx sin ut with A = 2 l/n 



and for 



(b) rj = A cos kx sin u>t with A = 4//(2« + 1) 
Sketch the first three harmonics for each case. 



(a) 



(b) 



fUo 

dx 



fUo 

dx 



fU 0 

dx 



*7 = 0 



Problem 6.13 

On p. 121 we discussed the problem of matching two strings of impedances Z\ and Z 3 by the 
insertion of a quarter wave element of impedance 

z 2 = (z l z 3 ) 1/2 

Repeat this problem for the acoustic case where the expressions for the string displacements 

AiOriAt 



now represent the appropriate acoustic pressures p,, p r and p t . 

Show that the boundary condition for pressure continuity at x = 0 is 



Ai + Z?i — A2 + R2 



and that for continuity of particle velocity is 

Z 2 (A 1 -S 1 )=Z 1 (A 2 -S 2 ) 
Similarly, at x = /, show that the boundary conditions are 

A 2 e _i * 2 ' + B 2 c ik2 ' =A 3 
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and 

Z 3 {A 2 c~ ikil - B 2 ^ 1 ) = Z 2 A 3 

Hence prove that the coefficient of sound transmission 

Zi4|_ , 

z 3 a\ 

when 

Zs,=Z\Z 3 and / = — 

4 

(Note that the expressions for both boundary conditions and transmission coefficient differ from 
those in the case of the string.) 

Problem 6.14 

For sound waves of high amplitude the adiabatic bulk modulus may no longer be considered as a 
constant. Use the adiabatic condition that 




P o |_V 0 (1 + 6)J 



in deriving the wave equation to show that each part of the high amplitude wave has its 
own sound velocity co(l + .S') !7+l ^ 2 , where c q = jPq/pq, S is the dilatation, .s the condensation and 
7 the ratio of the specific heats at constant pressure and volume. 

Problem 6.15 

Some longitudinal waves in a plasma exhibit a combination of electrical and acoustical phenomena. 
They obey a dispersion relation at temperature T of cu 2 = ui 2 e + 3 aTk 2 , where ui e is the constant 
electron plasma frequency (see Problem 5.18) and the Boltzmann constant is written as a to avoid 
confusion with the wave number k. Show that the product of the phase and group velocities is related 
to the average thermal energy of an electron (found from pV = RT ). 



Problem 6.16 

It is possible to obtain the wave equation for tidal waves (long waves in shallow water) by the 
method used in deriving the acoustic wave equation. In the figure a constant mass of fluid in an 
element of unit width, height h and length Ax moves a distance rj and assumes 
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a new height h + a and length (1 + dt]dx)Ax, but retains unit width. Show that, to a first 
approximation. 



a = 




Neglecting surface tension, the force on the element face of height h + a arises from the product of 
the height and the mean hydrostatic pressure. Show, if pgh <C P o (he- h <C 10 m) and a <C h, that 
the net force on the liquid element is given by 



OF da 

~ -s- A.v = -pgh — Ax 
ox Ox 



Continue the derivation using the acoustic case as a model to show that these waves are non- 
dispersive with a phase velocity given by v 2 = gh. 



Problem 6.17 

Waves near the surface of a non-viscous incompressible liquid of density p have a phase velocity 
given by 



v 2 (k) 




tanh kh 



where g is the acceleration due to gravity, T is the surface tension, k is the wave number and h is the 
liquid depth. When h -C A the liquid is shallow; when /i> A the liquid is deep. 



(a) Show that, when gravity and surface tension are equally important and li 3> A, the wave velocity 
is a minimum at ir = AgT / p, and show that this occurs for a ‘critical’ wavelength 

A r = 2n{T/pg) l/2 . 

(b) The condition A>A f defines a gravity wave, and surface tension is negligible. Show that 
gravity waves in a shallow liquid are non-dispersive with a velocity v = \fgh (see Problem 
6.16). 

(c) Show that gravity waves in a deep liquid have a phase velocity v = \fgjk and a group velocity 
of half this value. 

(d) The condition A < A t defines a ripple (dominated by surface tension). Show that short ripples in 
a deep liquid have a phase velocity v = y/Tk/p and a group velocity of | v. (Note the anomalous 
dispersion). 



Summary of Important Results 



Wave Velocity 



2 Bulk Modulus 7 P 

P P 
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Zi Reflected pressure 
Z2 Incident pressure 

TZ\ Z\ Transmitted pressure 

= = — x 

Z1+Z1 Z2 Incident pressure 




7 

Waves on Transmission Lines 



In the wave motion discussed so far four major points have emerged. They are 

1. Individual particles in the medium oscillate about their equilibrium positions with 
simple harmonic motion but do not propagate through the medium. 

2. Crests and troughs and all planes of equal phase are transmitted through the medium to 
give the wave motion. 

3. The wave or phase velocity is governed by the product of the inertia of the medium and 
its capacity to store potential energy; that is, its elasticity. 

4. The impedance of the medium to this wave motion is governed by the ratio of the 
inertia to the elasticity (see table on p. 546). 

In this chapter we wish to investigate the wave propagation of voltages and currents and 
we shall see that the same physical features are predominant. Voltage and current waves are 
usually sent along a geometrical configuration of wires and cables known as transmission 
lines. The physical scale or order of magnitude of these lines can vary from that of an 
oscilloscope cable on a laboratory bench to the electric power distribution lines supported 
on pylons over hundreds of miles or the submarine telecommunication cables lying on an 
ocean bed. 

Any transmission line can be simply represented by a pair of parallel wires into one end 
of which power is fed by an a.c. generator. Figure 7.1a shows such a line at the instant 
when the generator terminal A is positive with respect to terminal B, with current flowing 
out of the terminal A and into terminal B as the generator is doing work. A half cycle later 
the position is reversed and B is the positive terminal, the net result being that along each of 
the two wires there will be a distribution of charge as shown, reversing in sign at each half 
cycle due to the oscillatory simple harmonic motion of the charge carriers (Figure 7.1b). 
These carriers move a distance equal to a fraction of a wavelength on either side of their 
equilibrium positions. As the charge moves current flows, having a maximum value where 
the product of charge density and velocity is greatest. 



The Physics of Vibrations and Waves, 6th Edition H.J.Pain 

© 2005 J ohn Wiley & Sons, Ltd., ISBN: 0-470-01295-l(hardback); 0-470-01296-X (paperback) 



171 




172 



Waves on Transmission Lines 



(a) 



(b) 





Energy 

flow 



Inductive 

coupling 




Neon lamp 
glows brightly 
at voltage maxima 



Figure 7.1 Power fed continuously by a generator into an infinitely long transmission line. Charge 
distribution and voltage waves for (a) generator terminal positive at A and (b) a half period later, 
generator terminal positive at B. Laboratory demonstration (c) of voltage maxima along a Lecher wire 
system. The neon lamp gLows when held near a position of V mM 



The existence along the cable of maximum and minimum current values varying simple 
harmonically in space and time describes a current wave along the cable. Associated with 
these currents there are voltage waves (Figure 7.1a), and if the voltage and current at the 
generator are always in phase then power is continuously fed into the transmission line and 
the waves will always be carrying energy away from the generator. In a laboratory the 
voltage and current waves may be shown on a Lecher Wire sysem (Figure 7.1c). 

In deriving the wave equation for both voltage and current to obtain the velocity of wave 
propagation we shall concentrate our attention on a short element of the line having a 
length very much less than that of the waves. Over this element we may consider the 
variables to change linearly to the first order and we can use differentials. 

The currents which flow will generate magnetic flux lines which thread the region 
between the cables, giving rise to a self inductance Lq per unit length measured in henries 
per metre. Between the lines, which form a condenser, there is an electrical capacitance Co 
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per unit length measured in farads per metre. In the absence of any resistance in the line 
these two parameters completely describe the line, which is known as ideal or lossless. 



Ideal or Lossless Transmission Line 

Figure 7.2 represents a short element of zero resistance of an ideal transmission line length 
dr <C A (the voltage or current wavelength). The self inductance of the element is Lq dr 
and its capacitance if Co dr F. 

If the rate of change of voltage per unit length at constant time is dV/dx, then the 
voltage difference between the ends of the element dr is dV/dxdx. which equals the 
voltage drop from the self inductance — (Lq dx)dl/dt. 

Thus 



dV 

— d.r = -(L 0 dr) 



m 

dt 



or 



dV_ ffl 



(7.1) 



If the rate of change of current per unit length at constant time is dl/dx there is a loss of 
current along the length dr of —dl/dxdx because some current has charged the capacitance 
Co dr of the line to a voltage V. 

If the amount of charge is q = (Co dr) V, 



so that 



d ad, 

d ' = d7 = ft (c “ 4v)v 



—dl d 
-dv = s (c„d„r 



L 0 dx 

'j-^nnnr^- 

V C 0 dx 



< dx *- 

Figure 7.2 Representation of element of an ideal transmission line of inductance L 0 H per unit 
Length and capacitance C 0 F per unit length. The element length <tc A, the voltage and current 
wavelength 



l+^L dx 
dx 



l/+-|^dx 
i dx 
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or 



-dl dV 
~dx ~ C °~dt 



(7.2) 



Since d 2 / dxdt = d 2 / dt dx it follows, by taking d/dx of equation (7.1) and d/dt of 
equation (7.2) that 



d 2 V d 2 V 

~dx 2 ~ oCo W 



(7.3) 



a pure wave equation for the voltage with a velocity of propagation given by v 2 
Similarly d/dt of (7.1) and d/dx of (7.2) gives 



d 2 I _ d 2 I 
d^~ oCo W 



1/LqCo. 



(7.4) 



showing that the current waves propagate with the same velocity v 2 = 1/LqCo- We must 
remember here, in checking dimensions, that Lq and Co are defined per unit length. 

So far then, the oscillatory motion of the charge carriers (our particles in a medium) has 
led to the propagation of voltage and current waves with a velocity governed by the product 
of the magnetic inertia or inductance of the medium and its capactiy to store potential 
energy. 



Coaxial Cables 



Many transmission lines are made in the form of coaxial cables, e.g. a cylinder of dielectric 
material such as polythene having one conductor along its axis and the other surrounding 
its outer surface. This configuration has an inductance per unit length of 



, T , r 2 
Lq = — log e — H 
27 r r i 

where r \ and r 2 are the radii of the inner and outer conductors respectively and // is the 
magnetic permeability of the dielectric (henries per metre). Its capacitance per unit length 

27 re „ 



C 0 = 



log e r 2 /ri 



where e is the permittivity of the dielectric (farads per metre) so that v 2 = 1/LqCo = 1 //re. 

The velocity of the voltage and current waves along such a cable is wholly determined 
by the properties of the dielectric medium. We shall see in the next chapter on 
electromagnetic waves that fi and e represent the inertial and elastic properties of any 
medium in which such waves are propagating; the velocity of these waves will be given by 
v 2 = 1 /lie. In free space these parameters have the values 

fio = x 10 _7 Hm _1 
£0 — (367T x 10 9 )~’ Fm _1 

and v 2 becomes c 2 = (/uq£o) * where c is the velocity of light, equal to 3 x 10 8 m s -1 . 
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As we shall see in the next section the ratio of the voltage to the current in the waves 
travelling along the cable is 




where Z () defines the impedance seen by the waves moving down an infinitely long cable. It 
is called the Characteristic Impedance. 

We write e = e r e o where e r is the relative permittivity (dielectric constant) of a material 
and /j = //, /io, where /./,,■ is the relative permeability. Polythene, which commonly fills the 
space between r\ and r 2 , has e r ~ 10 and p r ~ 1. 

Hence 



where 




1 

27T 




1 1 , r 2 [pf 

b / l0 Se — \/ 

27T yje r 1 V So 



/^= 376.6 Q 

V £ o 



Typically, the ratio r 2 /r\ varies between 2 and 10 2 and for a laboratory cable using 
polythene Zq « 50 — 75 0 with a signal speed « c/3 where c is the speed of light. 

Coaxial cables can be made to a very high degree of precision and the time for an 
electrical signal to travel a given length can be accurately calculated because the velocity is 
known. 

Such a cable can be used as a ‘delay line’ in order to separate the arrival of signals at a 
given point by very small intervals of time. 

Characteristic Impedance of a Transmission Line 

The solutions to equations (7.3) and (7.4) are, of course, 

27T 

V + = Vq+ sin — {vt — x) 

A 

and 

2tt 

/+ = I o+ sin — (vt — x) 

A 

where Vo and / o are the maximum values and where the subscript + refers to a wave 
moving in the positive x-direction. Equation (7.1), dV/dx= —Lodl/dt, therefore gives 
— V' + = —vLoI' + , where the superscript refers to differentiation with respect to the bracket 
(vt — x). 

Integration of this equation gives 



V+ - vLoI+ 
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where the constant of integration has no significance because we are considering only 
oscillatory values of voltage and current whilst the constant will change merely the d.c. 
level. 

The ratio 




and the value of sJLq/Cq, written as Zq, is a constant for a transmission line of given 
properties and is called the characteristic impedance. Note that it is a pure resistance 
(no dimensions of length are involved) and it is the impedance seen by the wave 
system propagating along an infinitely long line, just as an acoustic wave experiences a 
specific acoustic impedance pc. The physical correspondence between pc and 
Lqv=^Lq/Cq = Z() is immediately evident. 

The value of Zq for the coaxial cable considered earlier can be shown to be 





n. 

r\ 



Electromagnetic waves in free space experience an impedance Zq = \J po/ £ o — 376.6 Q. 

So far we have considered waves travelling only in the x-direction. Waves which travel 
in the negative x-direction will be represented (from solving the wave equation) by 

2 d J[ 

V_ = Vo- sin — (vt + x) 

A 

and 

2tt 

I = I o- sin — (vt + x) 

A 

where the negative subscript denotes the negative x-direction of propagation. 

Equation (7.1) then yields the results that 

- — = —vLq = —Zq 



so that, in common with the specific acoustic impedance, a negative sign is introduced into 
the ratio when the waves are travelling in the negative x-direction. 

When waves are travelling in both directions along the transmission line the total voltage 
and current at any point will be given by 

v = v + + V- 

and 

/ = /+ + /_ 

When a transmission line has waves only in the positive direction the voltage and current 
waves are always in phase, energy is propagated and power is being fed into the line by the 
generator at all times. This situation is destroyed when waves travel in both directions; 
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waves in the negative x-direction are produced by reflection at a boundary when a line is 
terminated or mismatched; we shall now consider such reflections. 

(Problems 7.1, 7.2) 

Reflections from the End of a Transmission Line 

Suppose that a transmission line of characteristic impedance Z () has a finite length and that 
the end opposite that of the generator is terminated by a load of impedance Z L as shown in 
Figure 7.3. 

A wave travelling to the right (V + , 7 + ) may be reflected to produce a wave ( V . I ) 
The boundary conditions at Zl must be V+ + V- — V l, where V \ is the voltage across the 
load and 7 + +7_=7l- In addition V + /I + =Zq, V-/I- = — Zo and Vl/Il = Z l. It is easily 
shown that these equations yield 

V- Z L - Zp 

v+ Z L + Z„ 

(the voltage amplitude reflection coefficient), 

7- _ Zp - Z L 
7 + Zl + Zo 

(the current amplitude reflection coefficient), 

Vl 2Z l 
v+ Z L + Zo 



and 



7l _ 2Z 0 

-7+ Zl + Zo 

in complete correspondence with the reflection and transmission coefficients we have met 
so far. (See Summary on p. 546.) 



4 






(Y-./J 




v + +v_= v L 
/ ++ /_=/ L 
Y + ~V_ 



Figure 7.3 Transmission Line terminated by impedance Z l to produce reflected waves unless 
Z l = Z 0 , the characteristic impedance 
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We see that if the line is terminated by a load Zl = Zq, its characteristic impedance, the 
line is matched, all the energy propagating down the line is absorbed and there is no 
reflected wave. When Z L = Zq, therefore, the wave in the positive direction continues to 
behave as though the transmission line were infinitely long. 



Short Circuited Transmission Line (Z L = 0) 

If the ends of the transmission line are short circuited (Figure 7.4), Z L = 0, and we have 

V L = V+ + V_ = 0 

so that V + — —V-, and there is total reflection with a phase change of 7 r. But this is the 
condition, as we saw in an earlier chapter, for the existence of standing waves; we shall see 
that such waves exist on the transmission line. 

At any position x on the line we may express the two voltage waves by 

V+ = Z 0 /+ = V 0+ e i(w, “ fa) 

and 

V_ = — Z 0 /_ = Vo- e i(uJ,+fa) 

where, with total reflection and 7r phase change, Vo+ = — Vo-- The total voltage at x is 

V x = (V+ + V - ) = V 0+ (e~ ifa - e ikx ) e iut = (-i)2V 0+ sin kxe iuJt 

and the total current at x is 

- (/+ + I-) = — (e- ifa + e ifa ) e iurt = cos kxe iut 
Zq Zq 

We see then that at any point x along the line the voltage V x varies as sin kx and the 
current I x varies as cos kx, so that voltage and current are 90° out of phase in space. In 
addition the — i factor in the voltage expression shows that the voltage lags the current 90° 
in time, so that if we take the voltage to vary with cos cut from the e'“' term, then the current 



Voltage 




Current 



Figure 7.4 Short circuited transmission Line of length (2 n + l)A/4 produces a standing wave with 
a current maximum and zero voltage at end of line 
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will vary with — sin ujt. If we take the time variation of voltage to be as sin ujt the current 
will change with cos cut. 

Voltage and current at all points are 90° out of phase in space and time, and the power 
factor cos cf> = cos 90° = 0, so that no power is consumed. A standing wave system exists 
with equal energy propagated in each direction and the total energy propagation equal to 
zero. Nodes of voltage and current are spaced along the transmission line as shown in 
Figure 7.4, with / always a maximum where V = 0 and vice versa. 

If the current I varies with cos iot it will be at a maximum when V = 0; when V is a 
maximum the current is zero. The energy of the system is therefore completely exchanged 
each quarter cycle between the magnetic inertial energy \LqI 2 and the electric potential 
energy \ C 0 V 2 . 

(Problems 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 7.10, 7.11) 

The Transmission Line as a Filter 

The transmission line is a continuous network of impedances in series and parallel 
combination. The unit section is shown in Figure 7.5(a) and the continuous network in 
Figure 7.5(b). 

Zi 



Z 2 



(a) 

Zi Z, z. 



Z 2 



(b) 

Figure 7.5 (a) The elementary unit of a transmission Line, (b) A transmission line formed by a 

series of such units 
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Figure 7.6 A infinite series of elemenetary units presents a characteristic impedance Z 0 to a 
wave travelling down the transmission line. Adding an extra unit at the input terminal leaves Z 0 
unchanged 



If we add an infinite series of such sections a wave travelling down the line will meet its 
characteristic impedance Zq. Figure 7.6 shows that, adding an extra section to the beginning 
of the line does not change Zq. The impedance in Figure 7.6 is 



Z = Zi + 





-l 



or 



Z = Zj + 



Z2Z0 

Z 2 + Zq 



— Zq 



so the characteristic impedance is 



Z 0 



Zi 

2 




ZiZ 2 



Note that Z\/2 is half the value of the first impedance in the line so if we measure the 
impedance from a point half way along this impedance we have 

(Z\ \ 1/2 

z 0 = ( 4 ' wzrzij 



We shall, however, use the larger value of Zo in what follows. 

In Figure 7.7 we now consider the currents and voltages at the far end of the transmission 
line. Any V n since it is across Zq is given by V„ = I„Zq 
Moreover 

v„-v n+1 =I n Z l = v n % 

Z() 





The Transmission Line as a Filter 



181 




So 

V„+] _ ^ _ Z] _ Zo — Z\ 
V n Z 0 Z 0 

a result which is the same for all sections of the line. 

We define a propagation factor 

_ V „+ 1 _ Zo — Z] 

" _ _ z 0 

which, with 






1/2 




In all practical cases Zj/Zt is real since 

1. there is either negligible resistance so that Z\ and Z 2 are imaginary 
or 

2. the impedances are purely resistive. 
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So, given (1) or (2) we see that if 

2 



(a) 


(■** 


(b) 


0*t 



Z\ 



l 



z, 

4Z 2 



> 1 



2 

1 < 1 



then a is real, and 



then a is complex. 



For a real we have Z\ /4Z 2 >0 or < — 1. 

If Zi/ 4Z 2 > 0, then 0 < a < 1, the currents in successive sections decrease 
progressively and since a is real and positive there is no phase change from one section 
to another. 

If Zi/ 4Z 2 < — 1 , then o < 0, and there is again a progressive decrease in current 
amplitudes along the network but here a is negative and there is a ir phase change for each 
successive section. 

When a is complex we have 



and 



Zi 

— 1 < —7- < 0 

4Z 2 



a = 1 



A 

2Z 2 




1/2 



Note that led = 1 so we can write 



a = cos/3 — i sin /3 — e 



where 



cos /3 



1 + 



Zi 

2Z 2 



The current amplitude remains constant along the transmission line but the phase is 
retarded by /3 with each section. If Z\ and Z 2 are purely resistive a is fixed and the 
attenuation is constant for all voltage inputs. 

If Z] is an inductance with Z 2 a capacitance (or vice versa) the division between a real 
and a complex occurs at certain frequencies governed by their relative magnitudes. 

If Z\ — it oL and Z 2 = I /\ujC for an input voltage V = Vye'^' then a) = I when 
0 < u?LC < 4. 

So the line behaves as a low pass filter with a cut-off frequency lo c = 2 js/LC Above this 
frequency there is a progressive decrease in amplitude with a phase change of tt in each 
section. Figure 7.8a. 

If the positions of Z\ and Z 2 are now interchanged so that Z\ = 1/itcC is now a 
capacitance and Z 2 is now an inductance with Z 2 = \ojL the transmisson line becomes a 
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Figure 7.8 (a) When Zj = iuiL and Z 2 = (iu>L) 1 the transmission Line acts as a Low-pass fiLter. (b) 

Reversing the positions of Z\ and Z 2 changes the transmission Line into a high-pass fiLter 



high pass filter with zero attenuation for 0 < 1 /urLC < 4 that is for all frequencies above 
t o c = {l/ly/LC) Figure 7.8b. 

(Problem 7.12) 



Effect of Resistance in a Transmission Line 

The discussion so far has concentrated on a transmission line having only inductance and 
capacitance, i.e. wattless components which consume no power. In practice, of course, no 
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I 



V 



L 0 dx 

nwrnr- 



i 



R 0 dx 

— 

C 0 dx 




/+ dx 
— >- 3x 



t 



dx 

1 



Figure 7.9 Real transmission Line element includes a series resistance R 0 ft per unit length and a 
shunt conductance G 0 S per unit length 



such line exists: there is always some resistance in the wires which will be responsible for 
energy losses. We shall take this resistance into account by supposing that the transmission 
line has a series resistance RqQ per unit length and a short circuiting or shunting resistance 
between the wires, which we express as a shunt conductance (inverse of resistance) written 
as Go, where Go has the dimensions of siemens per metre. Our model of the short element 
of length dr of the transmission line now appears in Figure 7.9, with a resistance Rq dx in 
series with L 0 dx and the conductance Godx shunting the capacitance Codx. Current will 
now leak across the transmission line because the dielectric is not perfect. We have seen 
that the time-dependence of the voltage and current variations along a transmission line 
may be written 



V = Vq e luJf and 7 = / 0 e 1 ‘ 



so that 



dl dV 

Lq tt = luJ L 0 1 and C 0 — = iujCqV 
at at 



The voltage and current changes across the line element length dx are now given by 

(7.1a) 
(7.2a) 



dV dl , 

— = —Lq — — RqI — — (R o + iu>Lo)I 



dl dV r 

— = — Cq — — GqV = — ( Go + uuCo)V 



since (Godx)V is the current shunted across the condenser. Inserting d/dx of equation 
(7.1a) into equation (7.2a) gives 

d 2 V dl 

-Q~2= ~{Ro + iuL 0 ) — = (Rq +iiuL 0 )(Go +iiuC 0 )V = ^ 2 V 

where 7 2 = {Rq + ix'Z.o)(Go + ix'Co), so that 7 is a complex quantity which may be 
written 



7 = a + \k 
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Inserting d/dx of equation (7.2a) into equation (7.1a) gives 
d 2 1 dV 

~q~ 2 — —{Go + iwCo) -7^ = ( Ro + i^Lo)(Go + 'uoCq)I — 7“/ 



an equation similar to that for V. 
The equation 



d 2 V 

Ih 2 



-7 2 V = 0 



(7.5) 



has solutions for the x-dependence of V of the form 

V = A e~ 7 * or V = Be + ^ x 



where A and B are constants. 

We know already that the time-dependence of VTs of the form e lut , so that the complete 
solution for V may be written 



V= (A e-^ x + Be 7X )e i0Jt 

or, since 7 = a + ik, 

V = {A e- ax e~ ikx + Be ax e +ifa ) e iwf 

gi (uit-kx) _|_ g gO.T gi(wH-fcv) 

The behaviour of V is shown in Figure 7.10 — a wave travelling to the right with an 
amplitude decaying exponentially with distance because of the term e ~ ax and a wave 
travelling to the left with an amplitude decaying exponentially with distance because of the 
term e ax . 

In the expression 7 = a + ik, 7 is called the propagation constant, a is called the 
attenuation or absorption coefficient and k is the wave number. 




Figure 7.10 VoLtage and current waves in both directions along a transmission line with resistance. 
The effect of the dissipation term is shown by the exponentially decaying wave in each direction 




186 



Waves on Transmission Lines 



The behaviour of the current wave / is exactly similar and since power is the product VI, 
the power loss with distance varies as (e -ax ) 2 ; that is, as e 2ax . 

We would expect this behaviour from our discussion of damped simple harmonic 
oscillations. When the transmission line properties are purely inductive (inertial) and 
capacitative (elastic), a pure wave equation with a sine or cosine solution will follow. The 
introduction of a resistive or loss element produces an exponential decay with distance 
along the transmission line in exactly the same way as an oscillator is damped with time. 

Such a loss mechanism, resistive, viscous, frictional or diffusive, will always result in 
energy loss from the propagating wave. These are all examples of random collision 
processes which operate in only one direction in the sense that they are thermodynamically 
irreversible. At the end of this chapter we shall discuss their effects in more detail. 

Characteristic Impedance of a Transmission Line with 
Resistance 

In a lossless line we saw that the ratio V+/I+ = Zq = ^/Lq/Cq = Zq 0, a purely resistive 
term. In what way does the introduction of the resistance into the line affect the 
characteristic impedance? 

The solution to the equation d 2 I/dx 2 = 7 2 1 may be written (for the x-dependence of /) as 

1= (A' e~ 7 * + B' e 7 *) 

so that equation (7.2a) 

dl 

— — -(Go + icuCo)V 



gives 



— 7 (A' e _7 * — B' e 7 *) = -(G 0 + uuC 0 )V 



or 



V 7 ( R o + ia;L o)( G o + i^o) I A 'q—"i x B ' c rn = V= y 
G 0 + icuCo V 7 4 

But, except for the e luJf term, 

A' e~ 7 * = /+ 

the current wave in the positive x-direction, so that 



V_ 



R 0 + icuLo 
Go + ituCo 



U = V 



+ _ R 0 + itcLo _ , 
r V Go + i^Co “ 0 



or 



V- 

T 
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for a transmission line with resistance. Similarly B ' e 7t = I and 

V- _ I Rq + iujLo _ , 

77“ “VGo + iwCo 0 

The presence of the resistance term in the complex characteristic impedance means that 
power will be lost through Joule dissipation and that energy will be absorbed from the wave 
system. 

We shall discuss this aspect in some detail in the next chapter on electromagnetic waves, 
but for the moment we shall examine absorption from a different (although equivalent) 
viewpoint. 

(Problems 7.13, 7.14) 



The Diffusion Equation and Energy Absorption in Waves 

On p. 23 of Chapter 1 we discussed quite briefly the effect of random processes. We shall 
now look at this in more detail. The wave equation 

d 2 (j) 1 d 2 (j> 

dx 2 c 2 dt 2 



is only one of a family of equations which have a double differential with respect to space 
on the left hand side. 

In three dimensions the left hand side would be of the form 



d 2 f d 2 (f) d 2 cj) 

dx 2 dy 2 dz 2 



which, in vector language, is called the divergence of the gradient or div grad and is written 

Five members of this family of equations may be written (in one dimension) as 

1. Laplace’s Equation 



2. Poisson's Equation 




(for (j){x) only) 



d 2 4> 



= constant 



(for f{x) only) 



3. Helmholtz Equation 



d 2 (f) 

Ik 2 



constant x <fi 
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4. Diffusion Equation 

d 2 4> dcf) 

-7— r - = +ve constant x — 
ox 1 at 

5. Wave Equation 

d 2 d> d 2 d> 

a? = +ve cons,ant x ee 

Laplace’s and Poisson’s equations occur very often in electrostatic field theory and are 
used to find the values of the electric field and potential at any point. We have already met 
the Helmholtz equation in this chapter as equation (7.5), where the constant was positive 
(written 7 2 ) and we have seen its behaviour when the constant is negative, for it is then 
equivalent to the equation for standing waves (p. 124). The constant in the wave equation is 
of course 1/c 2 where c is the wave velocity. Where the wave equation has an ‘acceleration’ 
or d 2 (f)/dt 2 term on the right hand side, the diffusion equation has a ‘velocity’ or (hi)/ dt term. 

All equations, however, have the same term d 2 (j)/dx 2 on the left hand side, and we must 
ask: ‘What is its physical significance?’ 

We know that the values of the scalar 4> will depend upon the point in space at which it is 
measured. Suppose we choose some point at which 4> has the value 7o and surround this 
point by a small cube of side l, over the volume of which 0 may take other values. If the 
average value of 0 over the small cube is written 7, then the difference between the average 
cj) and the value at the centre of the cube 7o is given by 



cf) — 4>o = constant x 




d 2 (f) 

Th 2 




0 



This statement is proved in the appendix at the end of this chapter and is readily understood 
by those familiar with triple integration. The left hand side of any of these equations 
therefore measures the value 



In Laplace’s equation the difference is zero, so that 4> has a constant value over the 
volume considered. Poisson's equation tells us that the difference is constant and 
Helmholtz equation states that the value of 0 at any point in the volume is proportional to 
this difference. The first two equations are ‘steady state', i.e. they do not vary with time. 

The Helmholtz equation states that if the constant is positive the behaviour of 4> with 
space grows or decays exponentially, e.g. 7 2 is positive in equation (7.5), but if the constant 
is negative, 0 will vary sinusoidally or cosinusoidally with space as the displacement varies 
with time in simple harmonic motion and the equation becomes the time independent wave 
equation for standing waves. This equation says nothing about the time behaviour of 7, 
which will depend only upon the function (f) itself. 

Both the diffusion and wave equations are time-derivative dependent. The diffusion 
equation states that the ‘velocity’ or change of $ with time at a point in the volume is 
proportional to the difference 7 — 7o, whereas the wave equation states that the 
‘acceleration’ d 2 (j)/dt 2 depends on this difference. 
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The wave equation recalls the simple harmonic oscillator, where the difference from the 
centre (x = 0) was a measure of the force or acceleration term; both the oscillator and the 
wave equation have time varying sine and cosine solutions with maximum velocity df/dt 
at the zero displacement from equilibrium; that is, where the difference f — f>o = 0. 

The diffusion equation, however, describes a different kind of behaviour. It describes a 
non-equilibrium situation which is moving towards equilibrium at a rate governed by its 
distance from equilibrium, so that it reaches equilibrium in a time which is theoretically 
infinite. Readers will have already met this situation in Newton’s Law of Cooling, where a 
hot body at temperatue Tq stands in a room of lower temperature T. The rate at which the 
body cools, i.e. the value of dT/dt, depends on T — To; a cooling graph of this experiment 
is given in Figure 7.11. The greatest rate of cooling occurs when the temperature difference 
is greatest and the process slows down as the system approaches equilibrium. Here, of 
course, T — Tq and dT/dt are both negative. 

All non-equilibrium processes of this kind are unidirectional in the sense that they are 
thermodynamically irreversible. They involve the transport of mass in diffusion, the 
transport of momentum in friction or viscosity and the transport of energy in conductivity. 
All such processes involve the loss of useful energy and the generation of entropy. 

They are all processes which are governed by random collisions, and we found in the 
first chapter, where we added vectors of constant length and random phase, that the average 
distance travelled by particles involved in these processes was proportional, not to the time, 
but to the square root of the time. 

Rewriting the diffusion equation as 

d 2 (f) 1 df> 

dx 2 d dt 




Figure 7.11 Newton's cooling curve shows that the rate of cooling of a hot body dT/dt depends on 
the temperature difference between the body and its surrounding, this difference being directly 
measured by d 2 T/dx 2 
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we see that the dimensions of the constant d , called the diffusivity, are given by 

_ 1 <t> 

length 2 d time 

so that d has the dimensions of length 2 /time. The interpretation of this as the square of 
a characteristic length varying with the square root of time has already been made in 
Chapter 1. 

In a viscous process d is given by rj/ p, where rj is the coefficient of viscosity and p is the 
density. In thermal conductivity d = Kj pC p , where K is the coefficient of thermal 
conductivity, p is the density and C p is the specific heat at constant pressure. 

A magnetic field which is non-uniformly distributed in a conductor has a diffusivity 
d = {per) ~ , where p is the permeability and a is the conductivity. 

Brownian motion is one of the best known examples of random collision processes. The 
distance x travelled in time t by a particle suffering multiple random collisions is given by 
Einstein’s diffusivity relation 

_ T 2 _ 2 RT 
t 6irriN 

The gas law, pV — RT , gives RT as the energy of a mole of such particles at temperature 
T: a mole contains N particles, where N is Avogadro’s number and RT /N = kT, the average 
energy of the individual particles, where k is Boltzmann's constant. 

The process is governed, therefore, by the ratio of the energy of the particles to the 
coefficient of viscosity, which measures the frictional force. The higher the temperature, 
the greater is the energy, the less the effect of the frictional force and the greater the 
average distance travelled. 

Wave Equation with Diffusion Effects 

In natural systems we can rarely find pure waves which propagate free from the energy-loss 
mechanisms we have been discussing, but if these losses are not too serious we can 
describe the total propagation in space and time by a combination of the wave and diffusion 
equations. 

If we try to solve the combined equation 

d 2 (f> 1 d 2 cf) 1 d 4> 

dx 2 c 2 dt 2 d dt 

we shall not obtain a pure sine or cosine solution. 

Let us try the solution 

4> = <t> m ^ u, ~T x) 



where <j> m is the maximum amplitude. This gives 



• 2 2 

l 7 
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giving a complex value for 7. But lo 2 /c 2 = k 2 , where k is the wave number, and if we put 
7 = k — ia we obtain 

7 2 = k 2 — 2 i ka — a 2 ss k 2 — i 2 ka if a k 
The solution for cf> then becomes 

(f> = </> m = <f> m 

i.e. a sine or cosine oscillation of maximum amplitude which decays exponentially with 
distance. The physical significance of the condition a <+k = 2 tt/X is that many 
wavelengths A are contained in the distance 1/a before the amplitude decays to 7 ,„e 1 
at x = 1/a. Diffusion mechanisms will cause attenuation or energy loss from the wave; the 
energy in a wave is proportional to the square of its amplitude and therefore decays as 

g —2 ax 

(Problems 7.15, 7.16, 7.17) 

Appendix 

Physical interpretation of 



d 2 4> d 2 4> t d 2 

lh 2+ ~dy 2 



d 2 



= V 2 c 



At a certain point O of the scalar field, cf> = 7o- Constructing a cube around the point O 
having sides of length / gives for the average value over the cube volume 



cj>l 3 = 



+1/ 2 



-1/2 



4> dr d_y dz 



Expanding f about the point O by a Taylor series gives 



'deb 
+ I tt; 



dx 
( d 2 cj) 
Vkh 2 



dej) 



dd> 

iy ] - V+( - 



1 

2 

f dfjA 
\dxdy ) 



x 2 + 



0 

xy ■ 



d 2 cj) 

fy 2 
2 



dz 



y 2 + 



(d 2 f\ 
\dydz ) 



yz + 



z 

0 

d 2 cj)\ 

dz 2 ) 
(d 2 f\ 

l dzdxj 



I r 

0 



zr + 



Integrating from —1 / 2 to +1/2 removes all the functions of the form 



def) 

dx 



x and 



(d 2 f\ 

\dxdy ) 0 



x y 
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whose integrals are zero, leaving, since 




x 2 dxdydz 



<^>/ 3 




_ /5 
“12 

fd 2 <j) d 2 cj) 

\Jh 2 + lh 2 




i.e. 

£-*o = ^( v V)o 

where / is a constant. 



Problem 7.1 

The figure shows the mesh representation of a transmission line of inductance Lq per unit length and 
capacitance Co per unit length. Use equations of the form 




and 

d 

L 0 dx—I r = V r - V r+l 
at 

together with the method of the final section of Chapter 4 to show that the voltage and current wave 
equations are 

d 2 V _ d 2 V 

a^- ioCo a^ 



and 



d 2 I _ d 2 I 

d^~ oCo 9r2 
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Problem 7.2 

Show that the characteristic impedance for a pair of Lecher wires of radius r and separation d in a 
medium of permeability /i and permittivity e is given by 




Problem 7.3 

In a short-circuited lossless transmission line integrate the magnetic (inductive) energy \LqI 2 and 
the electric (potential) energy * CoV 2 over the last quarter wavelength (0 to — A/4) to show that they 
are equal. 

Problem 7.4 

Show, in Problem 7.3, that the sum of the instantaneous values of the two energies over the last 
quarter wavelength is equal to the maximum value of either. 

Problem 7.5 

Show that the impedance of a real transmission line seen from a position x on the line is given by 

Ae,~ lx -Bz+i* 

Z x — Zo 

Ae-v + Be+i* 

where 7 is the propagation constant and A and B are the current amplitudes at x = 0 of the waves 
travelling in the positive and negative .v-directions respectively. If the line has a length / and is 
terminated by a load Z/,, show that 



Ae^'-Be 1 ' 
Z L — Zo 7 7 

Ae-T' + BeT' 



Problem 7.6 

Show that the input impedance of the line of Problem 7.5; that is, the impedance of the line at x = 0, 
is given by 

z _ z ( Zo sinh 7 1 + Z, cosh 7 1 
1 0 yZo cosh 7/ + Z/, sinh 7/ 

(Note : 2 cosh 7/ = e 7/ + e _7/ 

2 sinh 7/ = e jl — e _7/ ) 



Problem 7.7 

If the transmission line of Problem 7.6 is short-circuited, show that its input impedance is given by 

Z sc = Z 0 tanh 7 1 

and when it is open-circuited the input impedance is 

Zqc = Z 0 coth 7/ 

By taking the product of these quantities, suggest a method for measuring the characteristic 
impedance of the line. 
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Problem 7.8 

Show that the input impedance of a short-circuited loss-free line of lenght / is given by 




and by sketching the variation of the ratio Z,/ \J Lq/Cq with /, show that for / just greater than 
(2« + 1) A/4, Zj is capacitative, and for / just greater than n\/2 it is inductive. (This provides a 
positive or negative reactance to match another line.) 

Problem 7.9 

Show that a line of characteristic impedance Zo may be matched to a load Zl by a loss-free quarter 
wavelength line of characteristic impedance Z„, if Zf n = ZqZ l . 

(Hint — calculate the input impedance at the ZoZ„, junction.) 

Problem 7.10 

Show that a short-circuited quarter wavelength loss-free line has an infinite impedance and that if it 
is bridged across another transmission line it will not affect the fundamental wavelength but will 
short-circuit any undesirable second harmonic. 

Problem 7.11 

Show that a loss-free line of characteristic impedance Zq and length n\/2 may be used to couple two 
high frequency circuits without affecting other impedances. 

Problem 7.12 

A transmission line has Z\ = iujL and Zo = (iwC) -1 . If, for a range of frequencies ui, the phase shift 
per section /3 is very small show that (3 = k the wave number and that the phase velocity is 
independent of the frequency. 

Problem 7.13 

In a transmission line with losses where Rq/uLq and Gq/loCq are both small quantities expand the 
expression for the propagation constant 

7 = [(/?o + ia>Lo)(Go + iuCo)] 1/2 

to show that the attenuation constant 



and the wave number 



Ro jC o G o I~Lq 
T \lL^ + ^2 \C~ 0 



k — LOy/ LqCq 



LO 

V 



Show that for Go = 0 the Q value of such a line is given by k/ 2a. 
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Problem 7.14 

Expand the expression for the characteristic impedance of the transmission line of Problem 7.13 in 
terms of the characteristic impedance of a lossless line to show that if 

Ru = Go 
Lo Co 

the impedance remains real because the phase effects introduced by the series and shunt losses are 
equal but opposite. 



Problem 7.15 

The wave description of an electron of total energy E in a potential well of depth V over the region 
0 < x < l is given by Schrodinger’s time independent wave equation 



d^P 

dx 2 



S>n 2 m 

h 2 



(. E -V)ip = 0 



where m is the electron mass and h is Planck’s constant. (Note that V = 0 within the well.) 




Show that for E > V (inside the potential well) the solution for ip is a standing wave solution but for 
E < V (outside the region 0 < x < l) the x dependence of ip is e ±7;t , where 

7 = ~^-\j2m(y -E) 
n 

Problem 7.16 

A localized magnetic field H in an electrically conducting medium of permeability /r and 
conductivity o will diffuse through the medium in the x-direction at a rate given by 

d H 1 d 2 H 

dt ficr dx 2 

Show that the time of decay of the field is given approximately by L 2 fj,a, where L is the extent of 
the medium, and show that for a copper sphere of radius 1 m this time is less than 100s. 

[i (copper) = 1 • 26 x 10 ~ 6 Hm~' 
a (copper) = 5 ■ 8 x 10 7 SnT 1 

(If the earth’s core were molten iron its field would freely decay in approximately 15x 10 3 years. In 
the sun the local field would take 10 10 years to decay. When a is very high the local field will change 
only by being carried away by the movement of the medium — such a field is said to be ‘frozen’ into 
the medium — the field lines are stretched and exert a restoring force against the motion.) 
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Problem 7.17 

A point xo at the centre of a large slab of material of thermal coductivity k, specific heat C and 
density p has an infinitely high temperature T at a time to- If the heat diffuses through the medium at 
a rate given by 



dT _ k d 2 T _ d z T 
dt pC dx 2 dx 2 



show that the heat flow along the x-aixs is given by 



/(cM) 



r 







where 



a = (x — xo) and r = — 7 = 

2 v / dt 

by inserting this solution in the differential equation. The solution is a Guassian function; its 
behaviour with x and t in this problem is shown in Fig. 10.12. At (xo, f 0 ) the function is the Dirac 
delta function. The Guassian curves decay in height and widen with time as the heat spreads through 
the medium, the total heat, i.e. the area under the Gaussian curve, remaining constant. 



Summary of Important Results 

Lossless Transmission Line 

Inductance per unit length = Lq or // 

Capacitance per unit length = Co or e 

Wave Equation 



d 2 V 

W 2 

d 2 I 

dx 2 



1 d 2 V 

v 2 dt 2 



(voltage) 



1 d 2 I 

v 2 dt 2 



(current) 



Phase Velocity 



Characteristic Impedance 




LoCo 



1 

or — 

pe 




(— Zq for left-going wave) 



(for right-going wave) 
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Transmission Line with Losses 



Resistane R o per unit length 
Shunt conductance Go per unit length 
Wave equation takes form 



e iw ' 





0 (same for /) 



where 7 = a + ik is the propagation constant 

a = attenuation coefficient 
k = wave number 



giving 



y — e ~ a,: e _|_ ft qOx ^i{iot+kx) 



Characteristic Impedance 



ry, V R() + i^O ( . ^ 

z ° = 7 = Vc7^ < n s ht -s»'"s 

(— Zq for left-going wave) 



Wave Attenuation 

Energy absorption in a medium described by diffusion equation 

d 2 (j) 1 drf) 

dx 2 d dt 

Add to wave equation to account for attenuation giving 

d 2 cf> 1 d 2 4> 1 <90 

dx 2 c 2 dt 2 ^ d dt 

with exponentially decaying solution 

0 = 0 m e^“e i(w, “ fa) 
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Electromagnetic Waves 



Earlier chapters have shown that the velocity of waves through a medium is determined by 
the inertia and the elasticity of the medium. These two properties are capable of storing 
wave energy in the medium, and in the absence of energy dissipation they also determine 
the impedance presented by the medium to the waves. In addition, when there is no loss 
mechanism a pure wave equation with a sine or cosine solution will always be obtained, but 
this equation will be modified by any resistive or loss term to give an oscillatory solution 
which decays with time or distance. 

These physical processes describe exactly the propagation of electromagnetic waves 
through a medium. The magnetic inertia of the medium, as in the case of the transmission 
line, is provided by the inductive property of the medium, i.e. the permeability /j, which has 
the units of henries per metre. The elasticity or capacitive property of the medium is 
provided by the permittivity e, with units of farads per metre. The storage of magnetic 
energy arises through the permeability //; the potential or electric field energy is stored 
through the permittivity e. 

If the material is defined as a dielectric, only // and e are effective and a pure wave 
equation for both the magnetic field vector H and the electric field vector E will result. If 
the medium is a conductor, having conductivity a (the inverse of resistivity) with 
dimensions of siemens per metre or (ohms m) -1 , in addition to // and e, then some of the 
wave energy will be dissipated and absorption will take place. 

In this chapter we will consider first the propagation of electromagnetic waves in a 
medium characterized by /i and e only, and then treat the general case of a medium having 
/y , e and cr properties. 

Maxwell's Equations 

Electromagnetic waves arise whenever an electric charge changes its velocity. Electrons 
moving from a higher to a lower energy level in an atom will radiate a wave of a particular 
frequency and wavelength. A very hot ionized gas consisting of charged particles will 
radiate waves over a continuous spectrum as the paths of individual particles are curved in 
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Figure 8.1 Wavelengths and frequencies in the electromagnetic spectrum 



mutual collisions. This radiation is called ‘Bremsstrahlung’. The radiation of electro- 
magnetic waves from an aerial is due to the oscillatory motion of charges in an alternating 
current flowing in the aerial. 

Figure 8.1 shows the frequency spectrum of electromagnetic waves. All of these waves 
exhibit the same physical characteristics. 

It is quite remarkable that the whole of electromagnetic theory can be described by the 
four vector relations in Maxwell’s equations. In examining these relations in detail we shall 
see that two are steady state; that is, independent of time, and that two are time-varying. 

The two time-varying equations are mathematically sufficient to produce separate wave 
equations for the electric and magnetic field vectors, E and El, but the steady state equations 
help to identify the wave nature as transverse. 

The first time-varying equation relates the time variation of the magnetic induction, 
pH = B , with the space variation of E; that is 



d 

dt 



(pH) is connected with 



8E 

dz 



(say) 



This is nothing but a form of Lenz's or Faraday’s Law, as we shall see. 

The second time-varying equation states that the time variation of eE defines the space 
variation of H, that is 



d_ 

dt 



(eE) 



is connected with 



dH 

~di 



(say) 



Again we shall see that this is really a statement of Ampere’s Law. 

These equations show that the variations of E in time and space affect those of H and 
vice versa. E and H cannot be considered as isolated quantities but are interdependent. 
The product eE has dimensions 



farads volts charge 

x = 

metre metre area 



This charge per unit area is called the displacement charge D — eE. 

Physically it appears in a dielectric when an applied electric field polarizes the 
constituent atoms or molecules and charge moves across any plane in the dielectric which 
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/ 




Figure 8.2 In this circuit, when the switch is dosed the conduction current charges the condenser. 
Throughout charging the quantity eE in the volume of the condenser is changing and the 
displacement current per unit area d/dt (sE) is associated with the magnetic field present between 
the condenser plates 



is normal to the applied field direction. If the applied field is varying or alternating with 
time we see that the dimensions of 



dD 

~dt 




charge 
time x area 



current per unit area. This current is called the displacement current. It is comparatively 
simple to visualize this current in a dielectric where physical charges may move — it is not 
easy to associate a displacement current with free space in the absence of a material but it may 
always be expressed as I c i = e(()<p\:J ()t), where (j)n is the electric field flux through a surface. 

Consider what happens in the electric circuit of Figure 8.2 when the switch is closed and 
the battery begins to charge the condenser C to a potential V. A current I obeying Ohm's 
Law (V = IR ) will flow through the connecting leads as long as the condenser is charging 
and a compass needle or other magnetic field detector placed near the leads will show the 
presence of the magnetic field associated with that current. But suppose a magnetic field 
detector (shielded from all outside effects) is placed in the region between the condenser 
plates where no ohmic or conduction current is flowing. Would it detect a magnetic field? 
The answer is yes; all the magnetic field effects from a current exist in this region as long 
as the condenser is charging, that is, as long as the potential difference and the electric field 
between the condenser plates are changing. 

It was Maxwell’s major contribution to electromagnetic theory to assert that the 
existence of a time-changing electric field in free space gave rise to a displacement current. 
The same result follows from considering the conservation of charge. The flow of charge 
into any small volume in space must equal that flowing out. If the volume includes the top 
plate of the condenser the ohmic current through the leads produces the flow into the 
volume, while the displacement current represents the flow out. 

In future, therefore, two different kinds of current will have to be considered: 

1. The familar conduction current obeying Ohm’s Law (V = IR) and 

2. The displacement current of density OD/dt. 




202 



Electromagnetic Waves 



In a medium of permeability p and permittivity e, but where the conductivity a = 0 , the 
displacement current will be the only current flowing . In this case a pure wave equation for 
E and H will follow and there will be no energy loss or attenuation. 

When (T^Oa resistive element allows the conduction current to flow, energy loss will 
follow, a diffusion term is added to the wave equation and the wave amplitude will 
attenuate exponentially with distance. We shall see that the relative magnitude of these two 
currents is frequency-dependent and that their ratio governs whether the medium behaves 
as a conductor or as a dielectric. 



Electromagnetic Waves in a Medium having Finite Permeability 
H and Permittivity s but with Conductivity a = 0 

We shall consider a system of plane waves and choose the plane xy as that region over 
which the wave properties are constant. These properties will not vary with respect to x and 
y and all derivatives d/dx and d/dy will be zero. 

The first time-varying equation of Maxwell is written in vector notation as 



, „ „ „ <9B <9H 

curl E = VxE — — — = — p — - 
at at 



This represents three component equations: 



d 

dt' 

d 



d 

dy ' 
d 






^dt Hy dz Ex dx Ez 



d_ 

dt' 



d 



d 

dz 

d_ 

dx 

d 



dx Ey dy Ex 



( 8 . 1 ) 



where the subscripts represent the component directions. E x , E y and E z are, respectively, the 
magnitudes of E x E y and E z . Similarly, H x , H y and H z are the magnitudes of H x H y and H z . 
The dimensions of these equations may be written 

pH _ E 
time length 

and multiplying each side by (length ) 2 gives 



pH 

time 



x area = E x length 



i.e. 

total magnetic flux 

= volts 

time 

This is dimensionally of the form of Lenz’s or Faraday’s Law. 
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The second time-varying equation of Maxwell is written in vector notation as 

, TT „ TT dV dE 

curl H = V x H = = e — 

at at 

This represents three component equations: 



d d d 

£ Jt Ex = dy Hz ~dz Hy 
d d d 

£ dt Ey ^dz Hx ~dx Hz 
d _ d Tt d TT 

£ dt Ez = dx Hy ~dy Hx 



The dimensions of these equations may be written 



current I 



length 



and multiplying both sides by a length gives 

current I 
length length 

which is dimensionally of the form of Ampere’s Law (i.e. the circular magnetic field at 
radius r due to the current I flowing in a straight wire is given by H = I /2nr). Maxwell's 
first steady state equation may be written 



div D = V ■ D — e 



dE x dE y dE z 
dx dy dz 



where e is constant and p is the charge density. This states that over a small volume element 
dr dy dz of charge density p the change of displacement depends upon the value of p. 
When p = 0 the equation becomes 



3E X dEy dE~ 

— -d H 1 

dx dy dz 



(8.3a) 



so that if the displacement D = eE is graphically represented by flux lines which must 
begin and end on electric charges, the number of flux lines entering the volume element dr 
dy dz must equal the number leaving it. 

The second steady state equation is written 



„ (dH x dH y dH,\ . 

div B = V ■ B = J — — + — — + — - = 0 
\ dx dy dz ) 
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Again this states that an equal number of magnetic induction lines enter and leave the 
volume dr dv dz. This is a physical consequence of the non-existence of isolated magnetic 
poles, i.e. a single north pole or south pole. 

Whereas the charge density p in equation (8.3) can be positive, i.e. a source of flux lines 
(or displacement), or negative, i.e. a sink of flux lines (or displacement), no separate source 
or sink of magnetic induction can exist in isolation, every source being matched by a sink 
of equal strength. 



The Wave Equation for Electromagnetic Waves 

Since, with these plane waves, all derivatives with respect to x and y are zero, equations 
(8.1) and (8.4) give 



-M 



djh 

dt 



= 0 



and 



dH z 

dz 



= 0 



therefore, H z is constant in space and time and because we are considering only the 
oscillatory nature of H a constant H- can have no effect on the wave motion. We can 
therefore put H z = 0. A similar consideration of equations (8.2) and (8.3a) leads to the 
result that E z = 0. 

The absence of variation in H z and E, means that the oscillations or variations in H and 
E occur in directions perpendicular to the z-direction. We shall see that this leads to the 
conclusion that electromagnetic waves are transverse waves. 

In addition to having plane waves we shall simplify our picture by considering only 
plane-polarized waves. 

We can choose the electric field vibration to be in either the x or y direction. Let us 
consider E x only, with E y = 0. In this case equations (8.1) give 

(8.1a) 



(8.2a) 



it follows by taking d/dt of equation (8.1a) and d/d z of equation (8.2a) that 



dHy _ dE x 
~^~dt~ ~ ~dz 

and equations (8.2) give 

dE x _ dH y 
dt dz 

Using the fact that 

d 2 _ d 2 
dzdt dtdz 



d 2 

fa2 Hy = /i£ H y 



d 2 
dt 2 ' 



(the wave equation for H y ) 
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Similarly, by taking d/d t of (8.2a) and d/dz of (8.1a), we obtain 

d 2 d 2 

— ~^E X = ue-—rE x (the wave equation for E x ) 
dz 1 dt- 

Thus, the vectors E x and H y both obey the same wave equation, propagating in the 
--direction with the same velocity v 2 = 1 /fie. In free space the velocity is that of light, that 
is, c 2 = I / /./()£ o, where m, is the permeability of free space and sq is the permittivity of 
free space. 

The solutions to these wave equations may be written, for plane waves, as 

2tt 

E x = E 0 sin — ( vt—z ) 

2tt 

Hy = H 0 sin — (vt — z) 

A 

where E {) and H {) are the maximum amplitude values of E and H. Note that the sine (or 
cosine) solutions means that no attenuation occurs: only displacement currents are involved 
and there are no conductive or ohmic currents. 

We can represent the electromagnetic wave (E x , H y ) travelling in the "-direction in 
Figure 8.3, and recall that because E z and H- are constant (or zero) the electromagnetic 
wave is a transverse wave. 

The direction of propagation of the waves will always be in the ExH direction; in this 
case, ExH has magnitude, E x H y and is in the z-di recti on. 

This product has the dimensions 

voltage x current electrical power 
length x length area 

measured in units of watts per square metre. 




Figure 8.3 In a plane-polarized electromagnetic wave the electric fieLd vector E x and magnetic 
field vector H y are perpendicular to each other and vary sinusoidally. In a non-conducting medium 
they are in phase. The vector product, ExH, gives the direction of energy flow 
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The vector product, ExH gives the direction of energy flow. The energy flow per second 
across unit area is given by the Poynting vector: 

1 

E x H* 

2 



(Problem 8.1) 



Illustration of Poynting Vector 

We can illustrate the flow of electromagnetic energy in terms of the Poynting vector by 
considering the simple circuit of Figure 8.4, where the parallel plate condenser of area A 
and separation d, containing a dielectric of permittivity e, is being charged to a voltage V. 

Throughout the charging process current flows, and the electric and magnetic field 
vectors show that the Poynting vector is always directed into the volume Ad occupied by 
the dielectric. 

The capacitance C of the condenser is eA/d and the total energy of the condenser at 
potential Vis ' CV 2 joules, which is stored as electrostatic energy. But V = Ed, where E is 
the final value of the electric field, so that the total energy 

5 c ' ,2 = Kt ) £: ‘ ,2 = 5 <e£2) " 

where Ad is the volume of the condenser. 

The electrostatic energy per unit volume stored in the condenser is therefore \eE 2 and 
results from the flow of electromagnetic energy during charging. 




ExH directed to 
condenser axis 



Figure 8.4 During charging the vector ExH is directed into the condenser volume. At the end of 
the charging the energy is totalLy electrostatic and equals the product of the condenser volume. Ad, 
and the electrostatic energy per unit volume, \sE 2 
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Impedance of a Dielectric to Electromagnetic Waves 

If we put the solutions 

E x = E q sin 7^ (vt - z) 

A 

and 



in equation (8.1a) where 



2tt 

H y = Ho sin — (vt — z) 
A 



dHy _ dE x 

fJ '~dT ~ ~dz 



then 



that is 



—pvH y — —E x , and since v 2 
yffiHy = y/iE x 



1 

pe 



Hy 




£o 

Ho 



which has the dimensions of ohms. 

The value \fpfe therefore represents the characteristic impedance of the medium to 
electromagnetic waves (compare this with the equivalent result V/I = \J~LoJCq = Z (] for 
the transmission line of the previous chapter). 

In free space 

= 376.7 

so that free space presents an impedance of 376.7 0 to electromagnetic waves travelling 
through it. 

It follows from 




and therefore 

eE\ = pH) 

Both of these quantities have the dimensions of energy per unit volume, for instance eE 
has dimensions 

farads volts 2 joules 

x i = r 

metre metre s ^ metres J 




>< to 
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as we saw in the illustration of the Poynting vector. Thus, for a dielectric the electrostatic 
energy \ zE 2 per unit volume in an electromagnetic wave equals the magnetic energy per 
unit volume \fiH 2 and the total energy is the sum \zE 2 -\- i /iH 2 . 

This gives the instantaneous value of the energy per unit volume and we know that, in 
the wave. 



E x — Eq sin (27 t/A )(vt — z ) 

and 

H y — H 0 sin (2ir/\)(vt - z) 

so that the time average value of the energy per unit volume is 

leEl + ^Hl = \eEl + \vHl 
— \ eE} ) Jm~ 3 

Now the amount of energy in an electromagnetic wave which crosses unit area in unit 
time is called the intensity, /, of the wave and is evidently {\eE^)v where v is the velocity 
of the wave. 

This gives the time averaged value of the Poynting vector and, for an electromagnetic 
wave in free space we have 

I = \ceoEq = ^c/ioHfi W m~ 2 

(Problems 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 8.10, 8.11) 

Electromagnetic Waves in a Medium of Properties n, £ and a 
(where a ^ 0) 

From a physical point of view the electric vector in electromagnetic waves plays a much more 
significant role than the magnetic vector, e.g. most optical effects are associated with the 
electric vector. We shall therefore concentrate our discussion on the electric held behaviour. 
In a medium of conductivity o = 0 we have obtained the wave equation 

d 2 E x d 2 E x 



where the right hand term, rewritten 



l 1 



d 

dt 



|(^> 



shows that we are considering a term 






d_ 

dt 



displacement current 



area 
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When a 0 we must also consider the conduction currents which flow. These currents are 
given by Ohm’s Law as / = V/R, and we define the current density; that is, the current per 
unit area, as 




Area 



1 V 

X 

R x Length Length 



c tE 



where a is the conductivity \/(R x Length) and E is the electric field. J = crE is another 
form of Ohm’s Law. 

With both displacement and conduction currents flowing, Maxwell’s second time- 
varying equation reads, in vector form, 

VxH = |-D + J (8.5) 



each term on the right hand side having dimensions of current per unit area. The presence 
of the conduction current modifies the wave equation by adding a second term of the same 
form to its righthand side, namely 

d /currentx . . d d 

'‘»(^r) wh ‘ ch,s '‘a (,)= '‘» ( ‘ TE) 



The final equation is therefore given by 



d 2 d 2 d 

d? Ex = ^d^ Ex + ^Jt Ex 



( 8 . 6 ) 



and this equation may be derived formally by writing the component equation of (8.5) as 

dE t 



dt 



together with 



dHy 

- + crE x = 

dz 


(8.5a) 


dH y dE x 
M dt ~ dz 


(8.1a) 



and taking d/dt of (8.5a) and d/dz of (8.1a). We see immediately that the presence of the 
resistive or dissipation term, which allows conduction currents to flow, will add a diffusion 
term of the type discussed in the last chapter to the pure wave equation. The product 
(per) 1 is called the magnetic diffusivity, and has the dimensions L 1 !' 1 , as we expect of 
all diffusion coefficients. 

We are now going to look for the behaviour of E x in this new equation, with the 
assumption that its time-variation is simple harmonic, so that E x = E oe lu L Using this 
value in equation (8.6) gives 



d 2 E x 
dz 2 



(i u>pa — ui 2 pe)E x = 0 
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which is in the form of equation (7.5), written 



d 2 E x 
dz 2 



te x = o 



where 7 2 = \ujjio — 

We saw in Chapter 7 that this produced a solution with the term e _7z or e +7z , but we 
concentrate on the E x oscillation in the positive z-di recti on by writing 

£ v = £ 0 e“e“ 7z 

In order to assign a suitable value to 7 we must go back to equation (8.6) and consider the 
relative magnitudes of the two right hand side terms. If the medium is a dielectric, only 
displacement currents will flow. When the medium is a conductor, the ohmic currents of 
the second term on the right hand side will be dominant. The ratio of the magnitudes of the 
conduction current density to the displacement current density is the ratio of the two right 
hand side terms. This ratio is 

J <jE x oE x <jE x (7 

dD/dt d/dt(eE x ) d/dt(sEoe luJt ) i weE x it oe 

We see immediately from the presence of i that the phase of the displacement current is 
90° ahead of that of the ohmic or conduction current. It is also 90° ahead of the electric 
field E x so the displacement current dissipates no power. 

For a conductor, where J dD/dt, we have a c oe, and 7 2 = icr(tu^) — t oe{u>ix) 

becomes 



7 



2 



i(Jt 0[l 



to a high order of accuracy. 
Now 



Vi = 



1 + 1 

Vi 



so that 



7 = 




1/2 



and 



E x = £ 0 e iwr e _7z 

= £0 e “ 1/2 z g i [<•*- (w/jct/2) 1/2 :] 
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Figure 8.5 Electromagnetic waves in a dielectric strike the plane surface of a conductor, and the 
electric field vector E 0 is damped to a value f 0 e -1 in a distance of ( 2 /uipa ) 1//2 , the 'skin depth'. This 
explains the electrical shielding properties of a conductor. A c is the wavelength in the conductor 

a progressive wave in the positive "-direction with an amplitude decaying with the factor 

g-(wr<cr /2 )^ 2 z 

Note that the product ujpo has dimensions L 2 . 

(Problem 8.12) 

Skin Depth 

After travelling a distance 

*=(-r 

\uipcr J 

in the conductor the electric field vector has decayed to a value E x = Eq e~ J ; this distance 
is called the skin depth (Figure 8.5). 

For copper, with pap, 0 and cr = 5.8 x 10 7 S m^ 1 at a frequency of 60 Hz, S « 9 mm; 
at 1 MHz, 6 « 6.6 x 1 0 5 m and at 30 000 MHz (radar wavelength of 1 cm), 
^ w 3.8 x 10 -7 m. 

Thus, high frequency electromagnetic waves propagate only a very small distance in a 
conductor. The electric field is confined to a very small region at the surface; significant 
currents will flow only at the surface and the resistance of the conductor therefore increases 
with frequency. We see also why a conductor can act to ‘shield’ a region from electro- 
magnetic waves. 



Electromagnetic Wave Velocity in a Conductor and Anomalous 
Dispersion 

The phase velocity of the wave v is given by 

LO U C f 2uA 1/ ~ 

V — — — — - = LUO = = v\ c 

k (ujpcr/2 ) 1/2 V W 
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When 6 is small, v is small, and the refractive index c/v of a conductor can be very large. 

We shall see later that this can explain the high optical reflectivities of good conductors. 

The velocity v — oj 6 = 2irv6, so that A, in the conductor is 2tt6 and can be very small. 

Since v is a function of the frequency an electrical conductor is a dispersive medium to 

electromagnetic waves. Moreover, as the table below shows us, dv/dX is negative, so that 

the conductor is anomalously dispersive and the group velocity is greater than the wave 

velocity. Since c 2 /v 2 = /./e///o£o = /<,-£ r , where the subscript r defines non-dimensional 

relative values; that is, /r/^to = Hr, e/eo = then for / x r « 1 

2 2 
£,-V = C 

and 

d 2 dv 

d\ £ ' v £r d\ 

which confirms our statement in the chapter on group velocity that for de r /d\ positive a 
medium is anomalously dispersive. We see too that c 2 /v 2 = e r = n 2 , where n is the 

refractive index, so that the curve in Figure 3.9 showing the reactive behaviour of the 

oscillator impedance at displacement resonance is also showing the behaviour of n. This 
relative value of the permittivity is, of course, familiarly known as the dielectric constant 
when the frequency is low. This identity is lost at higher frequencies because the 
permittivity is frequency-dependent. 

Note that A f = 2n6 is very small, and that when an electromagnetic wave strikes a 
conducting surface the electric field vector will drop to about 1% of its surface value in a 
distance equal to |A C = 4.6 6. Effectively, therefore, the electromagnetic wave travels less 
than one wavelength into the conductor. 



Frequency 


free space 


s 

(m) 


^conductor — 

(m/s) 


Refractive 

index 

(c/ V conductor) 


60 


5000 km 


9x 10~ 3 


3.2 


9.5xl0 7 


10 6 


300 m 


6.6 x 10 -5 


4.1 x 10 2 


7.3x 10 5 


3x10 10 


a 

Cd 

1 

o 


3.9x 10 -7 


7.1 x 10 4 


4.2x 10 3 



(Problems 8.13, 8.14, 8.15) 

When is a Medium a Conductor or a Dielectric? 

We have already seen that in any medium having gL£ and a properties the magnitude of the 
ratio of the conduction current density to the displacement current density 

J o 

dD/dt u>£ 



a non-dimensional quantity. 
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conduction 
current o E 



Conductivity 



(7 



Ft 



Figure 8.6 A simple circuit showing the response of a conducting medium to an electromagnetic 
wave. The total current density J is divided by the parallel circuit in the ratio a/we (the ratio of the 
conduction current density to the displacement current density). A large conductance a (small 
resistance) gives a large conduction current while a small capacitative reactance 1/uje allows a large 
displacement current to flow. For a conductor o/w£> 100; for a dielectric uie/o> 100. Note the 
frequency dependence of this ratio. At t u « 10 20 rad/s copper is a dielectric to X-rays 



We may therefore represent the medium by the simple circuit in Figure 8.6 where the 
total current is divided between the two branches, a capacitative branch of reactance 1/cue 
(ohms-metres) and a resistive branch of conductance a (siemens/metre). If a is large the 
resistivity is small, and most of the current flows through the o branch and is conductive. If 
the capacitative reactance 1/tue is so small that it takes most of the current, this current is 
the displacement current and the medium behaves as a dielectric. 

Quite arbitrarily we say that if 



J <7 
dD/dt cue > 



100 



then conduction currents dominate and the medium is a conductor. If 



dD/dt cue 
J = ~<j > 



100 



then displacement currents dominate and the material behaves as a dielectric. Between 
these values exist a range of quasi-conductors; some of the semi-conductors fall into this 
category. 

The ratio cr/cue is, however, frequency dependent, and a conductor at one frequency may 
be a dielectric at another. 

For copper, which has o = 5.8 x 10 7 S m _1 and e « £q— 9 x 10~ 12 F m -1 , 



10 18 



cue frequency 
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so up to a frequency of 10 16 Hz (the frequency of ultraviolet light) a /cue > 100, and 
copper is a conductor. At a frequency of 10 20 Hz, however (the frequency of X-rays), 
cue/ a > 100, and copper behaves as a dielectric. This explains why X-rays travel distances 
equivalent to many wavelengths in copper. 

Typically, an insulator has a sa 10~ 15 S m -1 and e ~ 10~ U F m _1 , which gives 

— » 10 4 cu 
cr 

so the conduction current is negligible at all frequencies. 



Why will an Electromagnetic Wave not Propagate into a 
Conductor? 

To answer this question we need only consider the simple circuit where a condenser C 
discharges through a resistance R. The voltage equation gives 



and since I = dq/dt, we have 

d<7 

~dt RC 



C 



IR = 0 



.-t/RC 



where q o is the initial charge. 

We see that an electric field will exist between the plates of the condenser only for a time 
t ~ RC and will disappear when the charge has had time to distribute itself uniformly 
throughout the circuit. An electric field can only exist in the presence of a non-uniform 
charge distribution. 

If we take a slab of any medium and place a charge of density q at a point within the slab, 
the medium will behave as an RC circuit and the equation 



becomes 



d = d oe 



-t/RC 



q = do 



-cr/we 



do e 



-at/e 




The charge will distribute itself uniformly in a time t ~ e/a, and the electric field will be 
maintained for that time only. The time e/a is called the relaxation time of the medium 
{RC time of the electrical circuit) and it is a measure of the maximum time for which an 
electric field can be maintained before the charge distribution becomes uniform. 

Any electric field of a frequency v, where \/v — t > e/ a, will not be maintained; only a 
high frequency field where \/v — t < e/a will establish itself. 
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Impedance of a Conducting Medium to Electromagnetic Waves 

The impedance of a lossless medium is a real quantity. For the transmission line of Chapter 
7 the characteristic impedance 






Ci 



z » = 77 = 

for an electromagnetic wave in a dielectric 

E x 

z = »r 

with E x and H y in phase. 

We saw in the case of the transmission line that when the loss mechanisms of a series 
resistance R o and a shunt conductance Go were introduced the impedance became the 
complex quantity 



Z = 



R< 



iu)Lo 



Go + ituCo 



We now ask what will be the impedance of a conducting medium of properties p, e and a to 
electromagnetic waves? If the ratio of E x to H y is a complex quantity, it implies that a 
phase difference exists between the two field vectors. 

We have already seen that in a conductor 

E x = Eq e iu ‘ e“ 7z 

where 7 = (1 + i) (uipa/2) l ^ 2 , and we shall now write H y = H 0 e 1 ^ - ^ e~ 7z , suggesting 
that H y lags E x by a phase angle <i>. This gives the impedance of the conductor as 



Equation (8.1a) gives 



so that 



Z, 



Hy H 0 



dE x 

dz 



= -fi 



dHy 

dt 



and 



-7 E.x = -lojpHy 



E x it op i(tu^) i( 1 — i) /2c op 

Hy 7 (1 +i)(ujpa/2) l/2 (1 + i)(l - i) V 



1/2 



(1 + i) /2c vp\ 1/? _ 1 + i /t ap\ */ 2 



2 \ cr J 



V2 Vex 



= (d£\ m ( ‘ +i T^ ) = 

V\/2 y/2 J 




216 



Electromagnetic Waves 



a vector of magnitude (lu/jl/ct) 



1/2 



and phase angle </> 



45°. Thus the magnitude 



E o /u;/4\ V 2 

c = ~tU) = vV/ 



and H y lags E x by 45°. 

We can also express Z e by 



Z c = R + iX 






1/2 



and also write it 



Zc 



1 + i /U2/i\ '/2 

1/T w 



/Ho£oj^ue i(/) 
£0 £ fiQ & 



of magnitude 



vv 

At a wavelength A = 1 0 1 m, i.e. at a frequency v = 3000 MHz, the value of loe/cf for 
copper is 2.9 x 1 0 9 and /j, r « e r « 1. This gives a magnitude Z c = 0.02 0 at this 
frequency; for cr = oo, Z c — 0, and the electric field vector E x vanishes, so we can say that 
when Z c is small or zero the conductor behaves as a short circuit to the electric field. This 
sets up large conduction currents and the magnetic energy is increased. 

In a dielectric, the impedance 





led to the equivalence of the electric and magnetic field energy densities; that is, 
\[iEl^ = In a conductor, the magnitude of the impedance 



Z r = 



E, 

H v 



UJgL\ t / 2 
O . 



so that the ratio of the magnetic to the electric field energy density in the wave is 

\liHy _ M cr _ cr 

£ 02/4 0 )£ 

We already know that this ratio is very large for a conductor for it is the ratio of 
conduction to displacement currents, so that in a conductor the magnetic field energy 
dominates the electric field energy and increases as the electric field energy decreases. 
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Reflection and Transmission of Electromagnetic Waves at a 
Boundary 

Normal Incidence 

An infinite plane boundary separates two media of impedances Z\ and Z 2 (real or complex) 
in Figure 8.7. 

The electromagnetic wave normal to the boundary has the components shown where 
subscripts i, r and t denote incident, reflected and transmitted, respectively. Note that the 
vector direction (E r x H r ) must be opposite to that of (E;X H i) to satisfy the energy flow 
condition of the Poynting vector. 

The boundary conditions, from electromagnetic theory, are that the components of the 
field vectors E and H tangential or parallel to the boundary are continuous across the 
boundary. 

Thus 



E j + E r — E t 



and 



//j ; - h , 



where 



E 1 




Et 



Z 2 




Figure 8.7 Reflection and transmission of an electromagnetic wave incident normally on a plane 
between media of impedances Z\ andZ 2 . The Poynting vector of the reflected wave (Ex H) r shows 
that either E or H may be reversed in phase, depending on the relative magnitudes of Z\ and Z 2 
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From these relations it is easy to show that the amplitude reflection coefficient 

E r Z 2 — Z\ 

R = — = — 

E\ Z 2 + Zi 

and the amplitude transmission coefficient 

T _ E t _ 2 Z 2 

E i Z 2 + Zi 

in agreement with the reflection and transmission coefficients we have found for the 
acoustic pressure p (Chapter 6) and voltage V (Chapter 7). If the wave is travelling in air 
and strikes a perfect conductor of Z 2 = 0 at normal incidence then 

Ex Z 2 Z i _ ^ 

E i Z 2 + Z] 

giving complete reflection and 

E t 2 Zi 

— = — = 0 

E\ Z 2 + Z] 

Thus, good conductors are very good reflectors of electromagnetic waves, e.g. lightwaves 
are well reflected from metal surfaces. (See Summary on p. 550.) 



Oblique Incidence and Fresnel's Equations for Dielectrics 

When the incident wave is oblique and not normal to the infinite boundary of Figure 8.7 we 
may still use the boundary conditions of the preceding section for these apply to the 
tangential components of E and H at the boundary and remain valid. 

In Figure 8.8(a) H is perpendicular to the plane of the paper with tangential components 
El{, H r and H, but the tangential components of E become 

E, cos 6 , Ex cos 6 and E t cos </>, respectively. 

In Figure 8.8(b) E is perpendicular to the plane of the paper with tangential components 
E\, Ex and E t but the tangential components of H become H ; cos 6, H r cos 6 and H t cos f>. 

Using these components in the expressions for the reflextion and transmission 
coefficients we have, for Figure 8.8(a) 

E x cos 6 E t cos i fi/Ht — £; cos 9/Hi 

Ei cos 9 Et cos 4>/H t + E, cos 9/Hi 

so 

Ex Z 2 cos f — Z\ cos 9 

^ Ei Z 2 cos (f> + Z\ cos 9 

where R || is the reflection coefficient amplitude when E lies in the plane of incidence. 
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Figure 8.8 Incident, reflected and transmitted components of a plane polarized electromagnetic 
wave at oblique incidence to the plane boundary separating media of impedances Zi and Z 2 . The 
electric vector lies in the plane of incidence in (a) and is perpendicular to the plane of incidence in (b) 



For the transmission coefficient in Figure 8.8(a) 

E t cos f 2 E t cos f/H t 

E, cos 9 E\ cos 9 /H\ + E t cos (j)/H t 



so 



E t 2 Z 2 cos 0 

^ E\ Z\ cos 9 + Z 2 cos f 

A similar procedure for Figure 8.8(b) where E is perpendicular to the plane of incidence 
yields 



Zi cos 9 — Z\ cos f 
Zi cos 9 + Z 1 cos f 



and 



T ± 



2 Z 2 cos 9 

Z 2 cos 9 + Z 1 cos f 



Now the relation between the refractive index n of the dielectric and its impedance Z is 
given by 

Z(free space) 

Z (dielectric) 



c 

n — - — 
v 



To£o 



= ve r = 




220 



Electromagnetic Waves 



where 

P _ , 

= Mr ~ 1 

MO 

Hence we have 



Zi 722 sin 9 

Z 2 /2 1 sin (f) 

from Snell’s Law and we may write the reflection and transmission amplitude coef- 
ficients as 

4 sin 4> cos 9 
sin 2cf) + sin 26 
2 sin (f> cos 6 
sin ((f) + 9) 

In this form the expressions for the coefficients are known as Fresnel's Equations. 
They are plotted in Figure 8.9 for ni/tii = 1.5 and they contain several significant 
features. 

When 6 is very small and incidence approaches the normal we have 0 — > 0 and 0 — > 0 
so that 



tan ((/) ~ 9) 
tan ((f) + 6) ’ 
sin (cf) — 6) 
sin ((f) + 6)’ 



sin ((f) — 9) ~ tan ((f) — 9) ~ ((f) — 0) 

and 









(^~g) 

(<^+g) 



1 

»2 



1 

«1 



1 

»2 



1 

«1 



77 i - 72 2 
77 1 + 72 2 



Thus, the reflected intensity 



R 



2 

6 -* 0 




~ 0.4 



f n\ - 77 2 \ 2 
\77 1 + 77 2/ 

at an air-glass interface. 



We note also that when tan (6 + (f>) = oo and 6 + (f> = 90° then R =0. 

In this case only is finite and the reflected light is completely plane polarized with the 
electric vector perpendicular to the plane of incidence. This condition defines the value of 
the Brewster or polarizing angle 6 b for, when 6 and (f> are complementary cos 6 b = sin <b so 

77 1 sin 9 b = 722 sin (f> — 722 cos 9 b 

and 



tan 9 b = 722/72 1 



which, for air to glass defines 9 B = 56°. 
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Figure 8.9 Amplitude coefficient R and T of reflection and transmission for n 2 /n 1 = 1.5. R y and T y 
refer to the case when the electric field vector E lies in the plane of incidence. R and T± apply when 
E is perpendicular to the plane of incidence. The Brewster angle 9g defines 9 + <j> = 90° when R \ = 0 
and the reflected Light is polarized with the E vector perpendicular to the plane of incidence. /? 
changes sign (phase) at 9g. When 9<9g, tan (</> — 9) is negative for n 2 /rii = 1.5. When 
9 + (j> > 90°, tan (f + 9) is also negative 



A typical modern laboratory use of the Brewster angle is the production of linearly 
polarized light from a He-Ne laser. If the window at the end of the laser tube is tilted so that 
the angle of incidence for the emerging light is 0 B an d R =0, then the light with its 
electric vector parallel to the plane of incidence is totally transmitted while some of the 
light with transverse polarization (R ) is reflected back into the laser off-axis. If the light 
makes multiple transits along the length of the tube before it emerges the transmitted beam 
is strongly polarized in one plane. 

More general but less precise uses involve the partial polarization of light reflected from 
wet road and other shiny surfaces where refractive indices are in the range n = 1.3 — 1.6. 
Polarized windscreens and spectacles are effective in reducing the glare from such 
reflections. 
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Reflection from a Conductor (Normal Incidence) 

For Z 2 a conductor and Z\ free space, the refractive index 

n = Zi = P 
Z 2 a + ia 



is complex, where 



P = 




and 



a = 



V 2 

\ 2 ov 



A complex refractive index must always be interpreted in terms of absorption because a 
complex impedance is determined by a complex propagation constant, e.g. here Z 2 — 
icu/r/ 7 , so that 









1/2 



where 



(mmq) 1/2 _ , 

E 

The ratio E r /E, is therefore complex (there is a phase difference between the incident and 
reflected vectors) with a value 

E r Z 2 — Z\ a + ia — (3 l — f3/a + i 

E\ Z 2 T Z\ a + lex + /3 1 T (3 / ex T i 



where /3/cx^> 1 . 

Since E v /E{ is complex, the value of the reflected intensity 7 r = (E r /Ei) 2 is found 
by taking the ratio the squares of the moduli of the numerator and the denominator, so 
that 



/r 



\E r \ 2 _ \Z 2 -Z l \ 2 P/a) 2 + l 

\Ei\ 2 |Z 2 +Z !| 2 (1 + p/ a) 2 + 1 



4 /3/a 

2 + 2/3/ ex + (/?/ ex) 



4a 

T 



(for (3/a > 1) 



= 1 - 




Electromagnetic Waves in a Plasma 



223 



so that 



/ r = 1 — 4 




1/2 





For copper a = 6 x I0 7 (ohmm ') and (2u>£o/<j ) 1/2 « 0.01 at infra-red frequencies. The 
emission from an electric heater at 10 3 K has a peak at A % 2.5 x 10 6 m. A metal reflector 
behind the heater filament reflects rs 97% of these infra-red rays with 3% entering the 
metal to be lost as Joule heating between the metal surface and the skin depth, (see 
Problem 8.20) 



(Problems 8.16, 8.17, 8.18, 8.19, 8.20, 8.21, 8.22, 8.23, 8.24) 



Electromagnetic Waves in a Plasma 

We saw in Problem 1.4 that when an electron in an atom or, quantum mechanically the 
charge centre of an electron cloud, moves a small distance from its equilibrium position, 
the charge separation creates an electric field which acts as a linear restoring force and the 
resulting motion is simple harmonic with an angular frequency tuo- For a hydrogen atom 

loq ~ 4.5 x 10 16 rads -1 

When a steady electric field is applied to a dielectric, the resulting charge separation 
between an electron and the rest of its atom induces a polarization field of magnitude 



£o 

where P defines the dipole moment per unit volume. Here, n e is the electron number 
density, x is the displacement from equilibrium and £q is the permittivity of free space. 
The value of P per unit electric field is called the susceptibility 

n e ex 

X = 7T 

£qE 

and the permittivity of the dielectric is given by 

£ =£ 0 (1 + X ) 

The relative permittivity or dielectric constant 

er _±. (1 + x) _( 1 + ^) ( S „ 

A steady electric field E defines a static susceptibility. An alternating electric field E defines 
a dynamic susceptibility in which case the relative permittivity. 

£ r = n 2 

where n is the refractive index of the medium. 
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There may be resistive or damping effects to the electric field within the medium and it is 
here that our discussion of the forced damped oscillator on p. 66 becomes significant (see 
Figure 3.9). 

If the electric field is that of an electromagnetic wave of angular frequency u> we have 
E = £’()<? '^ and the value of x in equatin (8.7) is that given by equation (3.2) on p. 67 
representing curve (a) in Figure 3.9 where Fq is now the force Ee acting on each electron. 

Equation (8.7) now becomes 



£ r — 1 + X— 1 + 



e e 2 m e (u>l - u> 2 ) 



£o[m 2 {ujQ — co 2 )~ + io 2 r 2 } 



where m t is the electron mass, ujq is its harmonic frequency within the atom, u> is the 
electromagnetic wave frequency and r is the damping constant. 

This is the solution given to problem 3.10. 

Note that for 



U) C tUo 



E r 



i + 



n & e 2 
£o m e u>l 



( 8 . 8 ) 



and for 



tv » 0J0 

n e e 2 (8-9) 

£ r ~ 1 o 

£o >n e u 

The factor n e e 2 /eom e in the second term of e r has a particular significance if the material 
is not a solid but an ionized gas called a plasma. Such a gas consists of ions and electrons of 
equal number densities n j = n e with charges of opposite signs ±e and masses m , and m e 
where m, 'v>> m e . Relative displacements between ions and electrons set up a restoring 
electric field which returns the electrons to equilibrium. The relatively heavy ions are 
considered as stationary. The result in Figure 8.10 shows a sheet of negative charge —n e ex 



Plasma 

n i = n i 



F nex 
£o 



-nex 




+nex 



Figure 8.10 In an ionized gas with equal number densities of ions and electrons (/7 = n e ) and 
ui j m e , relative displacements between ions and electrons form thin sheaths of charge ±nex, 
which generate an electric field E = nex/s 0 acting on each eLectron. The motion of each electron is 
simple harmonic with an electron plasma frequency tu p where u 2 = n e e z / e 0 m e rad s" 1 
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per unit area on one side of the plasma slab with the stationary ions producing a sheet of 
positive charge +n e ex on the other side (where n-, = n e ). 

This charge separation generates an electric held E in the plasma of magnitude 

„ n e ex 

E = 

£o 



which produces an electric force —n e e 2 x/eo acting on each electron in the direction of its 
equilibrium position. 

The equation of motion of each electron is therefore 



n e e 2 x 

m e x H 

£o 



= 0 



and the electron motion is simple harmonic with an angular frequency cu p where 

2 n e e 2 

UJ = 

p £0 rn e 



The angular frequency cu p is called the electron plasma frequency and plays a significant 
role in the propagation of electromagnetic waves in the plasma. 

In the expression for the refractive index 

lu 2 

e r = n~ w H — p (8.8) 

lo 0 



n is real for all values of lj and waves of that frequency will propagate. However, when 



£ r = n 




waves will propagate only when uj > cu p 

When u>1/uj 2 >1 



n 



2 




2u2 



c 2 k 







is negative and the wave number k is considered to be complex with 

k = k' — ia. 



( 8 . 9 ) 



In this case, electromagnetic waves incident on the plasma will be attenuated within the 
plasma, or if a is large enough, will be reflected at the plasma surface. 

The electric field of the wave £ = £oe‘^“ fc ' becomes E = Eoe~ az e 1 ^' - ^) and is 
reduced to Eoe 1 when z— 1/a = 6 the penetration depth. When a 'Jp k\ the penetration 
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is extremely small and 



so that 



and 



When 



—a = 




cc 




to C to p, 6 « c/to p 




Figure 8.11 The pinch effect. A plasma is formed when a Large electrical current I is discharged 
along the axis of a cylindrical tube of gas. The azimuthal magnetic field lines compress the plasma 
and when the conductivity of the plasma is very high the penetration of the field lines into the 
plasma is very small 
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On a laboratory scale number densities of the order n e ~ 10 6 - 1 0 10 m 3 are produced 
with electron plasma frequencies in the range w p «6x 10 1() -6 x 10 12 rad s _1 , several 
orders below that of visible light. 

For these values of u> p , electromagnetic waves have a penetration depth 

6 « — w 5 x 10 3 - 5 x 10- 5 m 

LOp 

The analysis above provides an experimental method of measuring the electron number 
density of a plasma using electromagnetic waves as a probe. The angular frequency of the 
transmitted wave is varied until propagation no longer occurs and a reflected wave is 
detected. 

The rejection of magnetic fields by a plasma is exploited in laboratory experiments on 
controlled thermonuclear fusion. In these a strong magnetic induction B is used as the 
confining mechanism to keep the plasma from the walls of its containing vessel. The 
magnetic energy per unit volume is given by B 2 / 2 [i and this has the dimensions of a 
pressure which opposes and often exceeds that of the hot ionized gas. 

The well-known ‘pinch effect’. Figure 8.11, results when a large current is discharged 
along the axis of gas contained in a cylindrical tube. The current ionizes the gas and its 
azimuthal field compresses the plasma in the radial direction towards the axis, increasing 
its temperature even further. Typical magnitudes in such an experiment are T 10 8 Kand 
n e ~ 10 21 m 3 . This corresponds to a pressure of ~ 14 atmospheres which requires a 
discharge current ~ 1 0 3 A’ A where R m is the radius of the cylinder. 
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The simple expression 




(8.9) 



for the index of refraction of a plasma is modified by the presence of an external static 
magnetic field. This situation exists in the ionosphere which consists of bands of low 
density ionized gas approximately 300 km above the earth and located within the earth’s 
dipole field of magnetic induction By. 

A charged particle of velocity v in such a field experiences an electric field E^vxBo 
and when v is in the plane perpendicular to Bo it rotates around the field line with an 
angular frequency uj = eB^/m, where e is the particle charge and m is its mass. This is most 
easily seen by considering the force mv 2 /r in a circular orbit balancing the electric force 
eE = e ■ v x B 0 . 

From mv 2 jr = evBy 
we have 




where / is the frequency of precession or the number of orbits per second made by the 
particle. 
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Figure 8.12 Charged particles of velocity v perpendicular to a magnetic field line B are bound to 
the field line and orbit around it due to the Lorentz force e(v x B). The radius L of the orbit, the 
Larmor radius, is given by L = mv/eB and the orbital Larmor frequency is cu b = eB/m rad s" 1 



Figure 8.12 shows the direction of motion for positive and negative charges around a 
magnetic field line which points upwards out of the paper. 

We consider the simplest case of electromagnetic wave propagation along the direction 
Bo and assume that 



• The amplitude of electron motion is small. 

• The value of n e is low enough to neglect collisional damping. 

• The magnetic induction B () >> the magnetic induction of the electromagnetic wave. 



If we consider the electric field to be that of a circularly polarized transverse 
electromagnetic wave, then we may write E = E(r \ + ir 2 ), where ri and r 2 are 
orthogonal (mutually perpendicular) unit vectors and Bo is along the r 3 direction. 

The equation of motion for an electron of velocity v is given by 

dv 

m — = Ee' rf + ct x Bo 
dr 

If we take the steady state electron velocity to be of the form 

v = u(r i + ir 2 ) e' ut 

we find that 



m( cu ± cub) 

satisfies the equation of motion 

This means that the electron precessing around Bo with an angular frequency cub is 
driven by a rotating electric field of effective frequency cu ± cub depending on the sign of 
the circular polarization. 

Due to the electronic motion there is a current density in the plasma given by 



-m e e- 

J = n e e\ = — ; — : -E. 



w(cu ± CUb) 
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In Maxwell's equation 

VxH = |-D + J 
at 



(8.5) 



we may write, in the absence of J, D = eoE but the presence of J will modify this and the 
right hand side of equation (8.5) becomes 



d_ 

dt 



D + J 



= f* Fc iux in e e 2 
dt 0 m(w ± cu B ) 



E 




E = itueE 



giving 



— = e r = n ± = 1 - 

£o \ 






U){ljJ ± (U B ) 



We see that the ionosphere is birefringent with two different values of the refractive 
index, n + for the right handed circularly polarized wave and n _ for the left handed incident 
polarization. These waves propagate at different velocities and their reception by the 
ionosphere will depend on their polarization. In its lower D layer the ionosphere has an 
electron number density n e < 10 9 m 3 with w p ~ 10 6 rads _1 and for the upper F 2 layer, 
« e < 1 0 1 2 m 3 with « 1() 7 rads *. Taking the value of the earth’s magnetic field as 
3 x 10“ 5 T: that is (0.3 G) gives an electron precession frequency w B ~ 6 x 10 6 rads _1 . 

Figure 8.13 shows the behaviour of n 2 + and n 1 versus uj/ujy, give for the hxed value of 
Wp/wB = 2. Other values of uj p /uj B give curves of a similar shape. In the wide frequency 
intervals where n 2 , and n 2 have opposite signs (positive or negative), one state of the 
circular polarization cannot propagate in the plasma and will be reflected when it strikes 
the ionosphere. The other wave will be partially transmitted. So, when a linearly polarized 
wave with uj < uj b in Figure 8.14 is incident on the ionosphere, the reflected wave will be 
elliptically polarized. The electron number densities in the ionosphere are measured by 
varying the frequency uj of the transmitted electromagnetic waves until reflection occurs. 
This method is similar to that used on the laboratory plasmas of the previous section. 
However, the value of « e varies in an ionospheric layer. It is found to increase with height 
until it reaches a maximum, only to fall off rapidly with a further increase in height. The 
height for a particular value of n e is measured by timing the interval between the trans- 
mitted and reflected wave. 

The analysis above explains the main features of radio reception which are: 

• Very high frequencies (VHF) are received over relatively short distances only. 

• Medium wave (MW) reception is possible over longer distances and improves at night. 

• Short wave (SW) reception is possible over very long distances. 
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Figure 8.13 The ionospheric plasma is birefringent to electromagnetic waves with different values 
of the refractive index n + for right handed circularly polarized waves and n _ for left handed 
circularly polarized waves. These values depend upon the ratio of the plasma frequency w p to the 
Larmor frequency wr. Graphs of n 2 + and n 2 _ are shown for a fixed value uj p /lob = 2 with a horizontal 
axis u/wb, where t o is the frequency of the propagating e.m. wave 



Very high frequencies are greater than uj p for both the D and F 2 layers; the waves 
propagate through both layers without reflection (Figure 8.15). The D layer has a plasma 
frequency ~300 kHz; that is, a wavelength of ~ 1 km and medium waves with 200 < 
A < 600 km are attenuated within it. However, the electron number density in the D 
layer, sustained by ionizing radiation during the day, drops very sharply after sunset and 
the medium waves are transmitted to the higher F 2 layer where they are reflected 
and received over longer distances. The D layer is transparent to short waves, 10 < 
A < 80 m, but these are reflected by the layer F 2 allowing long-distance radio reception 
around the earth. 
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(a) 

Figure 8.14 (a) The number density n e of a plasma (in this case the ionosphere) may be measured 

by a probing electromagnetic wave, the frequency of which is varied until reflection occurs. The time 
of the wave from transmission to reception is a measure of the height at which reflection occurs. The 
variation of number density n e with height h in an ionospheric Layer is shown in (b) 



VHF = Very high frequency T- Transmitter 

HW = Medium waves R - Receiver 

SW = Short waves 




Figure 8.15 Electron number densities in the ionosphere layers D and F 2 govern the pattern of 
radio reception. Very high frequencies (VHF) penetrate both Layers and are received only over short 
distances Medium waves (MW) are reflected at the D Layer during the daytime but are received over 
longer distances at night when n e of the D layer drops and medium waves proceed to the F 2 layer 
before reflection. Short waves (SW) penetrate the D layer to be reflected at the F 2 Layer and are 
received over very long distances 
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Problem 8.1 

The solutions to the e.m. wave equations are given in Figure 8.3 as 

2n 

E x = E 0 sin — (vt - z) 

A 



and 



Hy = H 0 sin — (vt - z) 

A 

Use equations (8.1a) and (8.2a) to prove that they have the same wavelength and phase as shown in 
figure. 

Problem 8.2 

Show that the concept of S 2 /2/i (magnetic energy per unit volume) as a magnetic pressure accounts 
for the fact that two parallel wires carrying currents in the same direction are forced together and that 
reversing one current will force them apart. (Consider a point midway between the two wires.) Show 
that it also explains the motion of a conductor carrying a current which is situated in a steady 
externally applied magnetic field. 



Problem 8.3 

At a distance r from a charge e on a particle of mass in the electric field value is E = e/Aneor 2 . 
Show by integrating the electrostatic energy density over the spherical volume of radius a to infinity 
and equating it to the value me 2 that the ‘classical" radius of the electron is given by 



a = 1.41 x 10 15 m 



Problem 8.4 

The rate of generation of heat in a long cylindrical wire carrying a current I is I 2 R, where R is the 
resistance of the wire. Show that this joule heating can be described in terms of the flow of energy 
into the wire from surrounding space and is equal to the product of the Poynting vector and the 
surface area of the wire. 



Problem 8.5 

Show that when a current is increasing in a long uniformly wound solenoid of coil radius r the total 
inward energy flow rate over a length / (the Poynting vector times the surface area 2nd) gives the 
time rate of change of the magnetic energy stored in that length of the solenoid. 



Problem 8.6 

The plane polarized electromagnetic wave (E x , H y ) of this chapter travels in free space. Show that its 
Poynting vector (energy flow in watts per squaremetre) is given by 



s = E x Hy = c(\e 0 E 2 x + i fi 0 H 2 ) = ce 0 E 2 x 
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where c is the velocity of light. The intensity in such a wave is given by 

/ = 5 av = cs 0 E 2 = iceo-Emax 

Show that 

S = 1.327 x 10 -3 E^ ax 
E max = 27.45 S 1/2 Vm~' 

//max = 7.3 x 10 -2 S 1/2 Am -1 



Problem 8.7 

A light pulse from a ruby laser consists of a linearly polarized wave train of constant amplitude 
lasting for 10 ~ 4 s and carrying energy of 0.3 J. The diameter of the circular cross section of the 
beam is 5x 10 “ 3 m. Use the results of Problem 8.6 to calculate the energy density in the beam to 
show that the root mean square value of the electric field in the wave is 

2.4 x 10 5 Vm^ 1 



Problem 8.8 

One square metre of the earth’s surface is illuminated by the sun at normal incidence by an energy 
flux of 1.35 kW. Show that the amplitude of the electric field at the earth’s surface is 1010 V m _1 
and that the associated magnetic field in the wave has an amplitude of 2.7 A m _1 (See Problem 8.6). 
The electric field energy density 2 sE 2 has the dimensions of a pressure. Calculate the radiation 
pressure of sunlight upon the earth. 



Problem 8.9 

If the total power lost by the sun is equal to the power received per unit area of the earth’s surface 
multiplied by the surface area of a sphere of radius equal to the earth sun distance (15xl0 7 km), 
show that the mass per second converted to radiant energy and lost by the sun is 4.2 x 10 9 kg. (See 
Problem 8.6.) 

Problem 8.10 

A radio station radiates an average power of 10 5 W uniformly over a hemisphere concentric with the 
station. Find the magnitude of the Poynting vector and the amplitude of the electric and magnetic 
fields of the plane electromagnetic wave at a point 10 km from the station. (See Problem 8.6) 

Problem 8.11 

A plane polarized electromagnetic wave propagates along a transmission line consisting of two 
parallel strips of a perfect conductor containing a medium of permeability p and permittivity e. A 
section of one cubic metre in the figure shows the appropriate field vectors. The electric field E x 
generates equal but opposite surface charges on the conductors of magnitude eE x Cm 2 . The motion 
of these surface charges in the direction of wave propagation gives rise to a surface current (as in the 
discussion associated with Figure 7.1). Show that the magnitude of this current is H y and that the 
characteristic impedance of the transmission line is 

Ex _ fp 

H y ~ V £ 
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Problem 8.12 

Show that equation (8.6) is dimensionally of the form (per unit area) 



where V is a voltage, L is an inductance and 7 is a current. 

Problem 8.13 

Show that when a group of electromagnetic waves of nearly equal frequencies propagates in a 
conducting medium the group velocity is twice the wave velocity. 

Problem 8.14 

A medium has a conductivity u = 10 _1 S m _I and a relative permittivity e r = 50, which is constant 
with frequency. If the relative permeability g, r = 1, is the medium a conductor or a dielectric at a 
frequency of (a) 50 kHz, and (b) 10 4 MHz? 

[eo = (36-7T x 10 9 ) -1 Fm _1 ; fio = 4n x 10~ 7 Hm~'] 



Answer: (a) o/uie = 720 (conductor) 

(b) a/c oe = 3.6 x 10 (dielectric). 

Problem 8.15 

The electrical properties of the Atlantic Ocean are given by 

e r = 81, yU r =l, cr = 4.3Sm _1 

Show that it is a conductor up to a frequency of about 10 MHz. What is the longest electromagnetic 
wavelength you would expect to propagate under water? 
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Problem 8.16 

Show that when a plane electromagnetic wave travelling in air is reflected normally from a plane 
conducting surface the transmitted magnetic field value H, w 2 H t , and that a magnetic standing 
wave exists in air with a very large standing wave ratio. If the wave is travelling in a conductor and is 
reflected normally from a plane conductor-air interface, show that E t « 2Ej. Show that these two 
cases are respectively analogous to a short-circuited and an open-circuited transmission line. 

Problem 8.17 

Show that in a conductor the average value of the Poynting vector is given by 

Sm = \E q Hq cos 45° 

= \Hq x (real part of Z c ) W m 2 

where Eq and Hq are the peak field values. A plane 1000 MHz wave travelling in air with Eq = 
1 V m _1 is incident normally on a large copper sheet. Show firstly that the real part of the conductor 
impedance is 8.2xlO -3 f2 and then (remembering from Problem 8.16 that H o doubles in the 
conductor) show that the average power absorbed by the copper per square metre is 1.6xl0~ 7 W. 

Problem 8.18 

For a good conductor s r = fi r = 1. Show that when an electromagnetic wave is reflected normally 
from such a conducting surface its fractional loss of energy (1 -reflection coefficient I r ) is 
~ ^/8 lje/ a. Note that the ratio of the displacement current density to the conduction current density 
is therefore a direct measure of the reflectivity of the surface. 

Problem 8.19 

Using the value of the Poynting vector in the conductor from Problem 8.17, show that the ratio of 
this value to the value of the Poynting vector in air is ~ y^8 lus/ct, as expected from Problem 8.18. 

Problem 8.20 

The electromagnetic wave of Problems 8.18 and 8.19 has electric and magnetic field magnitudes in 
the conductor given by 

£ t = Ae- b e i( “'- fe > 

and 



H 



y 




1/2 

e -kz pi(ut-kz) e -iir/4 



where k = (w/icr/2) *^ 2 . 

Show that the average value of the Poynting vector in the conductor is given by 



S 



av 




1/2 

e -2fe (W m 2 ) 



This is the power absorbed per unit area by the conductor. We know, however, that the wave 
propagates only a distance of the order of the skin depth, so that this power is rapidly transformed. 
The rate at which it changes with distance is given by dS^/dz, which gives the energy transformed 
per unit volume in unit time. Show that this quantity is equal to the conductivity o times the square 
of the mean value of the electric field vector E, that is, the joule heating from currents flowing in the 
surface of the conductor down to a depth of the order of the skin depth. 
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Problem 8.21 

Show that when light travelling in free space is normally incident on the surface of a dielectric of 
refractive index n the reflected intensity 



and the transmitted intensity 



(Note 7 r + / t = 1.) 

Problem 8.22 

Show that if the medium of Problem 8.21 is glass ( n = 1.5) then 7 r = 4% and 7 t = 96%. If an 
electromagnetic wave of 100 MHz is normally incident on water (e r = 81) show that 7 r = 65% and 
7, = 35%. 




Z;7t t 2 _ 4 n 

Z{E\ ( 1 + n) 2 



Problem 8.23 

Light passes normally through a glass plate suffering only one air to glass and one glass to air 
reflection. What is the loss of intensity? 



Problem 8.24 

A radiating antenna in simplified form is just a length xq of wire in which an oscillating current is 
maintained. The expression for the radiating power is that used on p. 47 for an oscillating electron 

p _ dE _ q 2 u A x 2 Q 
dr 127 T£oc 3 



where q is the electron charge and to is the oscillation frequency. The current 7 in the antenna may be 
written 7 q = u>q. If P = \RIq show that the radiation resistance of the antenna is given by 



where A is the radiated wavelength (an expression valid for A 3> Xo)- 

If the antenna is 30 m long and transmits at a frequency of 5 x10 s H with a root mean square 
current of 20 A, show that its radiation resistance is 1 .97 H and that the power radiated is 400 W. 
(Verify that A Xq.) 



Summary of Important Results 

Dielectric; p and e(a = 0) 

Wave equation 

d 2 E x _ d 2 E x 
dz 2 ^ £ dt 2 

d 2 H y _ d 2 H y 
dz 1 dt 2 
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Impedance 



Hy 




(376.70 for free space) 



Energy density ^ eE 2 + ^ pH 2 



Mean energy flow 



Intensity = S = u(mean energy density) 

v {\ eEl x+h* H y) average 



VeE x =\ VeE l( 



max) 



Conductor; p e and a 

Add diffusion equation to wave equation for loss effects from a 



d 2 E 2 x _ 
dz 2 



d 2 E 2 



pe 



dt 2 



+ per 



dE x 

dt 



giving 



where 



Skin Depth 



E x = £' 0 e“ fc e i ( uj; - fe ) 



k 2 — uipcr/l 



6 = - giving E x = E 0 e 1 
k 



Criterion for conductor/dielectric behaviour is ratio 



conduction current a 

= — (note frequency dependence) 

displacement current uje 



Impedance Z c (conductor) 



_ 1 + i fujp\ 1/2 

Zt “ ~En W 



with magnitude Z c = 376.6^ p r /e r y/uie/cr ohms 
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c Z (free space) 

v Z (dielectric) 



Electromagnetic Waves in a Plasma 

Low frequency waves propagate, but a high frequency wave E oe“ is attenuated or 
reflected when u < co p the plasma frequency, where u > 2 = n e e 2 /eo>n e . (n c is the electron 
number density.) 

The plasma has a refractive index n, where 




when tu p luq, the wave amplitude Eq — »• E oe 1 in a skin depth distance 





1/2 
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Plane Wave Representation in Two and Three Dimensions 

Figure 9. 1 shows that in two dimensions waves of velocity c may be represented by lines of 
constant phase propagating in a direction k which is normal to each line, where the 
magnitude of k is the wave number k = 2 tt/X. 

The direction cosines of k are given by 

l — — / n = — where k 2 = k 2 , + k 2 

k k 

and any point r(x, y) on the line of constant phase satisfies the equation 

lx + my = p = ct 

where p is the perpendicular distance from the line to the origin. The displacements at all 
points r(x, y) on a given line are in phase and the phase difference cf) between the origin and 
a given line is 

2tt 2ir 

cj) = — (path difference) = — p = \i-r = k\x + kyy 
A A 

= kp 

Hence, the bracket (cot — <f>) = (cot — kx) used in a one dimensional wave is replaced by 
(cut — k r) in waves of more than one dimension, e.g. we shall use the exponential 
expression 

gi(w7— k-r) 

In three dimensions all points r(x, y, z) in a given wavefront will lie on planes of constant 
phase satisfying the equation 

lx + my + nz — p — ct 
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Figure 9.1 Crests and troughs of a two-dimensional plane wave propagating in a general direction 
k (direction cosines / and m). The wave is specified by lx + my = p = ct, where p is its perpendicular 
distance from the origin, travelled in a time t at a velocity c 



where the vector k which is normal to the plane and in the direction of propagation has 
direction cosines 




(so that k 2 = k\ + k 2 + k\) and the perpendicular distance p is given by 

kp = k ■ r = k\x + k 2 y + k 2 z 



Wave Equation in Two Dimensions 

We shall consider waves propagating on a stretched plane membrane of negligible 
thickness having a mass p per unit area and stretched under a uniform tension T. This 
means that if a line of unit length is drawn in the surface of the membrane, then the 
material on one side of this line exerts a force T (per unit length) on the material on the 
other side in a direction perpendicular to that of the line. 

If the equilibrium position of the membrane is the xy plane the vibration displacements 
perpendicular to this plane will be given by z where z depends on the position x, y. In 
Figure 9.2a where the small rectangular element ABCD of sides 6x and 6y is vibrating, 
forces T6x and Tby are shown acting on the sides in directions which tend to restore the 
element to its equilibrium position. 

In deriving the equation for waves on a string we saw that the tension T along a curved 
element of string of length d.r produced a force perpendicular to x of 



T 



d 2 y 

dx 2 



dx 
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T8x 




x 



(a) (b) 

Figure 9.2 Rectangular element of a uniform membrane vibrating in the z-direction subject to one 
restoring force, T8x, along its sides of length by and another, Tby, along its sides of length bx 



where y was the perpendicular displacement. Here in Figure 9.2b by exactly similar 
arguments we see that a force T 8y acting on a membrane element of length bx produces a 
force 

d 2 z 

where z is the perpendicular displacement, whilst another force T bx acting on a membrane 
element of length by produces a force 

d 2 z 

T6x w Jy 



The sum of these restoring forces which act in the z-direction is equal to the mass of the 
element pbxby times its perpendicular acceleration in the z-direction, so that 

d 2 z d 2 z d 2 y 

T-^ i Sx6y + T g ? 6xSy = r6x6y 7jif 

giving the wave equation in two dimensions as 

d 2 z d 2 z _ p d 2 z _ 1 d 2 z 
dx 2 dy 2 T dt 2 c 2 dt 2 



where 



s-1 



The displacement of waves propagating on this membrane will be given by 



z — a e i ( wf_k ' r ) — A 



where 



k 2 = k\ + k\ 
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The reader should verify that this expression for z is indeed a solution to the two- 
dimensional wave equation when u> = ck. 

(Problem 9.1) 

Wave Guides 

Reflection of a 2D Wave at Rigid Boundaries 

Let us first consider a 2D wave propagating in a vector direction k(ki,k 2 ) in the xy plane 
along a membrane of width b stretched under a tension T between two long rigid rods 
which present an infinite impedance to the wave. 

We see from Figure 9.3 that upon reflection from the line y — b the component k\ 
remains unaffected whilst ki is reversed to — fci- Reflection at y — 0 leaves k \ unaffected 
whilst —ki is reversed to its original value /o. The wave system on the membrane will 
therefore be given by the superposition of the incident and reflected waves; that is, by 



z — j\ j & i[^-{kix+k 2 y)} _(_ ^ e i[ut-(k i x-k 2 y)] 



subject to the boundary conditions that 

Z = 0 at y — 0 and y = b 

the positions of the frame of infinite impedance. 

The condition z = 0 at y = 0 requires 

A-2 = ~A\ 



and z — 0 at y — b gives 



sin ^ 2 ^ = 0 



infinite 




Figure 9.3 Propagation of a two-dimensional wave along a stretched membrane with infinite 
impedances aty = 0 and y = b giving reversal of k 2 at each reflection 
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nir 

k 2 = — 



(Problem 9.2) 

With these values of Ai and ki the displacement of the wave system is given by the real 
part of z, i.e. 

z = + 2 A i sin kiy sin (cot — k \x) 

which represents a wave travelling along the x direction with a phase velocity 



u) ( k 



v * = vr\M v 



where v, the velocity on an infinitely wide membrane, is given by 



because 



v = — which is < v v 
k 



k 2 = k\+ k\ 



k\ = ( k 2 — 



k 2 = k j + - 



2 7 t 2 \ 1/2 /a ; 2 n 2 tt 2 \ 1/2 



and the group velocity for the wave in the x direction 



duj k\ 2 (k A 

Vg = dTi = ^j v = It) v 



giving the product 



Since k\ must be real for the wave to propagate we have, from 



k 2 = k 2 - 



k z = ^> 



2 2 2 
UJ H 7 T 



2 jj2 



the condition that 
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that is 



or 



LO > 



mtv 

b 



nv 

v> — , 
“ 2b' 



where n defines the mode number in the y direction. Thus, only waves of certain 
frequencies v are allowed to propagate along the membrane which acts as a wave guide. 

There is a cut-off frequency mtv /b for each mode of number n and the wave guide acts as 
a frequency filter (recall the discussion on similar behaviour in wave propagation on the 
loaded string in Chapter 4). The presence of the sin /^y term in the expression for the 
displacement z shows that the amplitude varies across the transverse y direction as shown in 
Figure 9.4 for the mode values n = 1,2,3. Thus, along any direction in which the waves 
meet rigid boundaries a standing wave system will be set up analogous to that on a string of 
fixed length and we shall discuss the implication of this in the section on normal modes and 
the method of separation of variables. 

Wave guides are used for all wave systems, particularly in those with acoustical and 
electromagnetic applications. Fibre optics is based on wave guide principles, but the major 
use of wave guides has been with electromagnetic waves in telecommunications. Here the 
reflecting surfaces are the sides of a copper tube of circular or rectangular cross section. 
Note that in this case the free space velocity becomes the velocity of light 




the phase velocity, but the relation v p v g = c 2 ensures that energy in the wave always 
travels with a group velocity v g < c. 




Figure 9.4 Variation of amplitude withy-direction for two-dimensionaL wave propagating aLong the 
membrane of Figure 9.3. Normal modes (n = 1, 2 and 3 shown) are set up along any axis bounded by 
infinite impedances 
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(Problems 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, 9.10, 9.11) 



Normal Modes and the Method of Separation of Variables 

We have just seen that when waves propagate in more than one dimension a standing wave 
system will be set up along any axis which is bounded by infinite impedances. 

In Chapter 5 we found that standing waves could exist on a string of fixed length l where 
the displacement was of the form 

. sin j , sin j 
v = A }kx }u) n t, 
cos cos 



where A is constant and where 



means that either solution may be used to 



fit the boundary conditions in space and time. When the string is fixed at both ends, the 
condition y = 0 at x = 0 removes the cos kx solution, and y — 0 at x — l requires k n l = rnr 
or k„ = nit /l = 2ir/X„, giving / = nX„/2. Since the wave velocity c = u n X n , this permits 
frequencies = 2iw n = imc/l, defined as normal modes of vibration or eigenfrequen- 
cies. 

We can obtain this solution in a way which allows us to extend the method to waves in 
more than one dimension. We have seen that the wave equation 

d 2 (j) _ 1 d 2 f 

dx 2 c 2 dt 2 

has a solution which is the product of two terms, one a function of x only and the other a 
function of t only. 

Let us write f = X (x) T ( t) and apply the method known as separation of variables. 

The wave equation then becomes 

d 2 X 1 d 2 T 
— T = ^X — 



X xx T = - XT,, 
c~ 

where the double subscript refers to double differentiation with respect to the variables. 
Dividing by f = X{x)T{t) we have 



where the left-hand side depends on x only and the right-hand side depends on t only. 
However, both x and t are independent variables and the equality between both sides can 
only be true when both sides are independent of x and t and are equal to a constant, which 
we shall take, for convenience, as —k 2 . Thus 



A' 2 , giving X xx + k 2 X = 0 
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and 



\ T ^r = ~k 2 , giving T„ + c 2 k 2 T = 0 
c z T 

X(x) is therefore of the form e ±lfcc and T(t) is of the form e ±lckt , so that cf) — A t ±lkx e, ±lckt , 
where A is constant, and we choose a particular solution in a form already familiar to us by 
writing 



<f>— A Q'( ckt ~ kx ) 

— A Q^ut-kx) ’ 



where cu = ck, or we can write 



as above. 



, . sin 1 , sin 

6= A }kx 

cos J cos 



ckt 



Two-Dimensional Case 

In extending this method to waves in two dimensions we consider the wave equation in 
the form 

d 2 (f> d 2 cf > 1 d 2 cj) 

dx 2 dy 2 c 2 dt 2 

and we write <f> — X(x)Y(y)T(t), where Y(y) is a function of y only. 

Differentiating twice and dividing by cf> = XYT gives 




1 T 



tt 



c 2 T 



where the left-hand side depends on x and y only and the right-hand side depends on t only. 
Since x, y and t are independent variables each side must be equal to a constant, — k 2 say. 
This means that the left-hand side terms in x and y differ by only a constant for all x and y, 
so that each term is itself equal to a constant. Thus we can write 



and 



^ _ ,2 Yyy _ < 2 

X — *1) Y ~ K 2 

■4£=-<d + ki) = -p 
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giving 

+ k\x = 0 

Yyy + kjY = 0 
T tt + c 2 k 2 T = 0 

or 

(j) = Ae ±ikxx t ±ik 2 y e ±ickt 

where k 2 = k 2 + k\. Typically we may write 

, . sin 1 . sin 1 , sin 1 , 

d> = A >k\x >/c?y ?ckt. 
cos J cos J cos J 

Three-Dimensional Case 

The three-dimensional treatment is merely a further extension. The wave equation is 

d 2 cf) d 2 (f) d 2 (j> 1 d 2 (j) 

dx 2 dy 2 dz 2 c 2 dt 2 

with a solution 

4> — X(x)Y(y)Z(z)T(t) 

yielding 

, 4 sin 1 , sin 1 , sin 1 , sin 1 , 

(b — A }k\x >koy fk^z >ckt , 
cos J COS J ' cos J cos J 

where k 2 + k\ + k 2 — k 2 . 

Using vector notation we may write 

^ = Ae^ w '“ kr \ where k ■ r = k\x + kyy + k^z 

Normal Modes in Two Dimensions on a Rectangular Membrane 

Suppose waves proceed in a general direction k on the rectangular membrane of sides a 
and b shown in Figure 9.5. Each dotted wave line is separated by a distance A/2 and a 
standing wave system will exist whenever a = n \ A A 1 and b = ni BB', where n i and iii are 
integers. 

But 

A A k X2 tt 1 7 r 

2cosa 2k\ 2 A k\ k\ 



AA' 
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Figure 9.5 Normal modes on a rectangular membrane in a direction k satisfying boundary 
conditions of zero displacement at the edges of length a = n-iX/Z cos a and b = n 2 A/ 2 cos/? 



so that 



n i7r n i7r 

a = — — and A' i = . 

k i rz 



Similarly 



Hence 



A 2 



/2 2 7T 

IT 



A 2 



47T 2 




or 



l = M + n l 

A \ a 2 b 2 

defining the frequency of the n i th mode on the x-axis and the n 2 th mode on the y-axis, that 
is, the (« i« 2 ) normal mode, as 



c n ? nl 2 

where c ' 



r 

p 



If k is not normal to the direction of either a or b we can write the general solution for 
the waves as 



sin 



cos 



sin 



z = A }kix }k 2 y 



cos 



sin 

cos 



ckt. 



with the boundary conditions z = 0 at x = 0 and a;z=0aty = 0 and b. 
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The condition z = 0 at x = y = 0 requires a sin k\x sin k 2 y term, and the condition z = 0 
at x = a dehnes k\ = n\ir/a. The condition z — 0 at y — b gives k 2 = nyx/b, so that 

. n 1 ttx . n 2 zry . 

z = A sin sin — — sin ckt 

a b 

The fundamental vibration is given by n 1 = 1, ni = 1, so that 



v = 



1 1 \ T 

a 2 b 2 ) 4 p 



In the general mode (« i« 2 ) zero displacement or nodal lines occur at 

a 2 a 



x = 0, — , — , . . . a 

n 1 n\ 



and 



y = 0 , 



b 2b 
n 2 ’ n 2 



Some of these normal modes are shown in Figure 9.6, where the shaded and plain areas 
have opposite displacements as shown. 




Figure 9.6 Some normal modes on a rectangular membrane with shaded and clear sections having 
opposite sinusoidal displacements as indicated 
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The complete solution for a general displacement would be the sum of individual normal 
modes, as with the simpler case of waves on a string (see the chapter on Fourier Series) 
where boundary conditions of space and time would have to be met. Several modes of 
different values (n in 2 ) may have the same frequency, e.g. in a square membrane the modes 
(4,7) (7,4) (1,8) and (8,1) all have equal frequencies. If the membrane is rectangular and 
a — 3 b, modes (3,3) and (9,1) have equal frequencies. 

These modes are then said to be degenerate, a term used in describing equal energy 
levels for electrons in an atom which are described by different quantum numbers. 



Normal Modes in Three Dimensions 

In three dimensions a normal mode is described by the numbers m,n 2 ,n 3 , with a 
frequency 




(9.1) 



where / 1 , 1 2 and 1 3 are the lengths of the sides of the rectangular enclosure. If we now form a 
rectangular lattice with the x-, y- and ’-axes marked off in units of 

c c c 

277 ’ 2/7 an 2/7 



respectively (Figure 9.7), we can consider a vector of components n \ units in the 
x-direction, n 2 units in the y-direction and nj, units in the ’-direction to have a length 





n 

7 



2 

2 

2 

3 




Figure 9.7 Lattice of rectangular ceLls in frequency space. The length of the vector joining the 
origin to any cell corner is the value of the frequency of an allowed normal mode. The vector 
direction gives the propagation direction of that particular mode 
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Each frequency may thus be represented by a line joining the origin to a point 
cn i/2/ 1, c«2/2/2> cw 3/2/3 in the rectangular lattice. 

The length of the line gives the magnitude of the frequency, and the vector direction 
gives the direction of the standing waves. 

Each point will be at the corner of a rectangular unit cell of sides c/ 2 / 1, c/ 2 / 2 and c/2/3 
with a volume c 3 / 8 / 1/2/3- There are as many cells as points (i.e. as frequencies) since each 
cell has eight points at its corners and each point serves as a corner to eight cells. 

A very important question now arises: how many normal modes (stationary states in 
quantum mechanics) can exist in the frequency range u to v + Au? 

The answer to this question is the total number of all those positive integers n 1 , «2, «3 for 
which, from equation ( 9 . 1 ), 



v < 



l 2 

l 2 




< (v + Av ) 2 



This total is the number of possible points {n\,n2,nf) lying in the positive octant 
between two concentric spheres of radii v and u + Au. The other octants will merely repeat 
the positive octant values because the n s appear as squared quantities. 

Hence the total number of possible points or cells will be 



1 (volume of spherical shell) 



8 volume of cell 
4 m / 2 Au 8/1 (2/3 



8 

= 47r/ 1/2/3 




so that the number of possible normal modes in the frequency range u to u + Av per unit 
volume of the enclosure 

4 m / 2 dz y 



Note that this result, per unit volume of the enclosure , is independent of any particular 
system; we shall consider two very important applications. 



Frequency Distribution of Energy Radiated from a Hot Body. 
Planck's Law 

The electromagnetic energy radiated from a hot body at temperature T in the small 
frequency interval u to u + Au may be written E v Au. If this quantity is measured 
experimentally over a wide range of u a curve T \ in Figure 9.8 will result. The general 
shape of the curve is independent of the temperature, but as T is increased the maximum of 
the curve increases and shifts towards a higher frequency. 

The early attempts to describe the shape of this curve were based on two results we have 
already used. 
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Figure 9.8 Planck's black body radiation curve plotted for two different temperatures T 2 > T lr 
together with the curve of the classical Rayleigh-0. 6-Jeans explanation Leading to the 'ultra-violet 
catastrophe' 

In the chapter on coupled oscillations we associated normal modes with ‘degrees of 
freedom’, the number of ways in which a system could take up energy. In kinetic theory, 
assigning an energy \kT to each degree of freedom of a monatomic gas at temperature T 
leads to the gas law pV = RT = NkT where N is Avogadro’s number, k is Boltzmann's 
constant and R is the gas constant. 

If we assume that each frequency radiated from a hot body is associated with the normal 
mode of an oscillator with two degrees of freedom and two transverse planes of 
polarization, the energy radiated per frequency interval do may be considered as the 
product of the number of normal modes or oscillators in the interval do and an energy 
contribution of kT from each oscillator for each plane of polarization. This gives 

4no 2 do 2 kT 87 xo 2 kT do 

E„ do = 5 = X 

c i 

a result known as the Rayleigh-Jeans Law. 

This, however, gives the energy density proportional to o 2 which, as the solid curve in 
Figure 9.8 shows, becomes infinite at very high frequencies, a physically absurd result 
known as the ultraviolet catastrophe. 

The correct solution to the problem was a major advance in physics. Planck had 
introduced the quantum theory, which predicted that the average energy value kT should be 
replaced by the factor ho/(e hl k kr — 1 ), where h is Planck’s constant (the unit of action) as 
shown in Problem 9.12. The experimental curve is thus accurately described by Planck's 
Radiation Law 



E v do 



8tro 2 ho 
c 3 e hv/kT _ l dU 
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(Problem 9.12) 



Debye Theory of Specific Heats 



The success of the modern theory of the specific heats of solids owes much to the work of 
Debye, who considered the thermal vibrations of atoms in a solid lattice in terms of a vast 
complex of standing waves over a great range of frequencies. This picture corresponds in 
three dimensions to the problem of atoms spaced along a one dimensional line (Chapter 5). 
In the specific heat theory each atom was allowed two transverse vibrations (perpendicular 
planes of polarization) and one longitudinal vibration. 

The number of possible modes or oscillations per unit volume in the frequency interval v 
to v + du is then given by 



d n = 47 tv" du 




(9.2) 



where cj and c / are respectively the transverse and longitudinal wave velocities. 

Problem 9.12 shows that each mode has an average energy (from Planck’s Law) of 
e = hi//(e lu '/ kr — 1) and the total energy in the frequency range v to v -\- dv for a gram 
atom of the solid of volume V A is then 



V^ed n = 4 ttV a 




hi ' 3 

eWw - 1 



The total energy per gram atom over all permitted frequencies is then 



E a = 



V A £& n = 47rl 7 a 




Vm hv 2, . 

e hv/kT _ l l ' 



where v m is the maximum frequency of the oscillations. 

There are N atoms per gram atom of the solid (N is Avogadro’s number) and each atom 
has three allowed oscillation modes, so an approximation to v m is found by writing the 
integral of equation (9.2) for a gram atom as 



dn = 3 N = 4ttV a 




•Cm 

. 0 



v 2 Av = 



4ttV a 

3 





v 



3 

m 



The values of c T and c / can be calculated from the elastic constants of the solid (see 
Chapter 6 on longitudinal waves) and v m can then be found. 

The values of E A thus becomes 



p 9 N 

v± 



hv 



> hv/kT _ ] 



r d v 



and the variation of E,\ with the temperature T is the molar specific heat of the substance at 
constant volume. The specific heat of aluminium calculated by this method is compared 
with experimental results in Figure 9.9. 




254 



Waves in More than One Dimension 




Figure 9.9 Debye theory of specific heat of soLids. Experimental, values versus theoretical curve for 
aluminium 

(Problems 9.13, 9.14, 9.15, 9.16, 9.17, 9.18, 9.19) 

Reflection and Transmission of a Three-Dimensional Wave at a 
Plane Boundary 

To illustrate such an event we choose a physical system of great significance, the passage of 
a light wave from air to glass. More generally. Figure 9.10 shows a plane polarized 
electromagnetic wave E; incident at an angle 0 to the normal of the plane boundary z — 0 
separating two dielectrics of impedance Z\ and Zi, giving reflected and transmitted rays E r 
and E t , respectively. The boundary condition requires that the tangential electric field E x is 
continuous at z — 0. The propagation direction k; of E; lies wholly in the plane of the 
paper (y = 0) but no assumptions are made about the directions of the reflected and 
transmitted waves (nor about the planes of oscillation of their electric field vectors). 

We write 



E- = A - = A ■ g'M - M^sinfl+zeosO)] 

E r = A r e i ( a<-kr ' r ) — J \ i Q i l Ut ~( k ri x + lc r2y+krSZ)] 

and 

E — A t e i(w«— k t -r) _ ^ e i[ut-(k,ix+k a y+k a z)] 

where k Y (k r i,k r 2 ,k^) and k t (fc t i,&t 2 ,&t 3 ) are respectively the reflected and transmitted 
propagation vectors. 

Since E x is continuous at - = 0 for all x, y, t we have 

A e i[oJf— *i(jc sin 0)] _j_ g i [ut-(k r ix+kfiy)\ 

— A t e i [‘‘’*-(*«i JC +*< 2 y)] 
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'LE x = 0 at z = 0 



Figure 9.10 Plane-polarized electromagnetic wave propagating in the plane of the paper is 
represented by vector E,- and is reflected as vector E r and transmitted as vector E t at a plane interface 
between media of impedances Z\ and Z 2 . No assumptions are made about the planes of propagation 
of E r and Et. From the boundary condition that the electric field component E x is continuous at the 
plane z = 0 it follows that (1) vectors E; E r and E t propagate in the same plane; (2) 0=6' (angLe of 
incidence = angle of reflection); (3) Snell's law (sin 0/ sin f) = nz/n\, where n is the refractive 
index 



an identity which is only possible if the indices of all three terms are identical; that is 

iot — k[X sin 6 = u>t — k v \x + k x yy 
= cut — k x \x + k t 2 y 



Equating the coefficients of x in this identity gives 

k\ sin# = k rl — k t \ 



whilst equal coefficients of y give 



0 — k r 2 — k t 2 



The relation 



krl — k t 2 — 0 

shows that the reflected and transmitted rays have no component in the y direction and lie 
wholly in the xz plane of incidence; that is, incident reflected and transmitted (refracted) 
rays are coplanar. 
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Now the magnitude 



ki = kl = Y~ 

Ai 



since both incident and reflected waves are travelling in medium Z i . Hence 

k x sin 8 — k r \ 

gives 



that is 



k\ sin# = k t sin#' 



8 = 8 ' 



so the angle of incidence equals the angle of reflection. 
The magnitude 

k ~ — 
t_ A 2 



so that 



gives 



or 



k{ sin 8 = k t i = k t sin ( 



27T . 27 r . 

— sin 8 = — sin < 
A[ At 



sin# A] m 

sincj) A 2 n\ 



Refractive Index (medium 2) 



Refractive Index (medium 1) 



a relationship between the angles of incidence and refraction which is well known as 
Snell’s Law. 



Total Internal Reflection and Evanescent Waves 

On p. 254 we discussed the propagation of an electromagnetic wave across the boundary 
between air and a dielectric (glass, say). We now consider the reverse process where a wave 
in the dielectric crosses the interface into air. 

Snell’s Law still holds so we have, in Figure 9.11, 

n i sin# = n 2 sin < p 

where 

n i > n 2 and n 2 /n \ = n r < 1 

Thus 



sin# = (« 2 /«i) sin^ = /i r sin^ 
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n 2 


tZ 4 > 
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k Z 

n 2 


j 


\ X / 
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e \ 
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e \ * 





(a) (b) 



n 1 > n 2 




Figure 9.11 When Light propagates from a dense to a rare medium (n i > 02 ) Snell's Law defines 
6 = 9 C as that angle of incidence for which <j> = 90° and the refracted ray is tangential to the plane 
boundary. Total internal reflection can take place but the boundary conditions still require a 
transmitted wave known as the evanescent or surface wave. It propagates in the x direction but its 
amplitude decays exponentially with z 



with cj)> 0. Eventually a critical angle of incidence 9 C is reached where <f> = 90° and 
sin 9 = n r ; for 9 > 9 C , sin# > n r . For glass to air n r = dj and 9 C = 42°. 

It is evident that for 9>9 C no electromagnetic energy is transmitted into the rarer 
medium and the incident wave is said to suffer total internal reflection. 

In the reflection coefficients R n and R± on p. 218 we may replace cos cf) by 

(1 — sin 2 (/)) */ 2 = [1 — (sin#/« r )~] 1/2 
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and rewrite 

sin 2 #) 1 ^ 2 — n 2 cos 6 
sin 2 9) 1 ^ 2 + n 2 cos# 

and 

cos 8 — (n 2 — sin 2 8) 1|/2 
cos 0 + (« 2 — sin 2 8) '/ 2 

Now for 0 > d c , sin d > n r and the bracketed quantities in R and R are negative so 
that 1 1 and R are complex quantities; that is (E r ) ,, and (E r ) ± have a phase relation which 
depends on 6. 

It is easily checked that the product of R and R* is unity so we have R R = R R = 1. 
This means, for both the examples of Figure 8.8, that the incident and reflected intensities 
/ ! and 7 r = 1 . The transmitted intensity I t = 0 so that no energy is carried across the 
boundary. 

However, if there is no transmitted wave we cannot satisfy the boundary condition 
E\ + E r = E t on p. 254, using only incident and reflected waves. We must therefore assert 
that a transmitted wave does exist but that it cannot on the average carry energy across the 
boundary. 

We now examine the implications of this assertion, using Figure 9.10 above, and we 
keep the notation of p. 254. This gives a transmitted electric field vector 

E t — A e i M _ (^ ti -' : +*t 2 )’+*t3z)] 

A gi[wf— fc t (jrsin0+zcos0)] 




because y = 0 in the xz plane, k tl = k t sin <j> and k t 3 = k t cos cj). Now 



cos (f> =1 — sin 2 0=1 — sin 2 #/n 2 

.’. k t cos 0 = ±& t (l — sin 2 0/n 2 ) 



which for 0 > 8 C gives sin 0 > n r so that 

k t cos 0 = =piA' t 



sm 2 8 \' /2 

^72 ) 



We also have 



k t sin0 = k t sin 8/n r 



so 



£ — A g=F 0Z e i{wt—k T x sin 0/ n r ) 
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The alternative factor e +/3z defines an exponential growth of A t which is physically 
untenable and we are left with a wave whose amplitude decays exponentially as it 
penetrates the less dense medium. The disturbance travels in the x direction along the 
interface and is known as a surface or evanescent wave. 

It is possible to show from the expressions for R\\ and R L on p. 258 that except at 
6 = 90° the incident and the reflected electric field components for (E) | in one case and 
(E) x in the other, do not differ in phase by 7r rad and cannot therefore cancel each other 
out. The continuity of the tangential component of E at the boundary therefore leaves a 
component parallel to the interface which propagates as the surface wave. This effect has 
been observed at optical frequencies. 

Moreover, if only a very thin air gap exists between two glass blocks it is possible for 
energy to flow across the gap allowing the wave to propagate in the second glass block. 
This process is called frustrated total internal reflection and has its quantum mechanical 
analogue in the tunnelling effect discussed on p. 431. 



Problem 9.1 

Show that 

, = ^gi {ut-(kiX+kiy)} 

where k 2 = ur 2 /c 2 = k\ + k\ is a solution of the two-dimensional wave equation 

d 2 z d 2 z _ 1 d 2 z 
dx 2 dy 2 c 2 dt 2 



Problem 9.2 

Show that if the displacement of the waves on the membrane of width b of Figure 9.3 is given by the 
superposition 



z _ a 1 e i M-(*i*+* : »')] e i[^r— (* i jc— * iy)] 

with the boundary conditions 

Z = 0 at y = 0 and y = b 



then the real part of z is 



where 



Z = +2Ai sinA^y sin (urt — k\X ) 



k 2 



rnr 

b 



Problem 9.3 

An electromagnetic wave loses negligible energy when reflected from a highly conducting surface. 
With repeated reflections it may travel along a transmission line or wave guide consisting of two 
parallel, infinitely conducting planes (separation a). If the wave 
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is plane polarized, so that only E z exists, then the propagating direction k lies wholly in the xy plane. 
The boundary conditions require that the total tangential electric field E z is zero at the conducting 
surfaces x = 0 and x = a. Show that the first boundary condition allows E z to be written E z = 
E o(e lk * x — e~' kxX ) e 1 ^ - "'), where k x = k cos# and k y = k sin# and the second boundary condition 
requires k x = mr/a. 

If Ao = 27 xc/ui, \ c = 2n/k x and X g = lix/ky are the wavelengths propagating in the x and y 
directions respectively show that 

1 11 

A? + A|'Al 

We see that for n = 1, k x = ir/a and A c = 2a, and that as u> decreases and Ao increases, k y = k sin 6 
becomes imaginary and the wave is damped. Thus, n = 2(k x = 2n /a) gives A c = a, the ‘critical 
wavelength’, i.e. the longest wavelength propagated by a waveguide of separation a. Such cut-off 
wavelengths and frequencies are a feature of wave propagation in periodic structures, transmission 
lines and wave-guides. 

Problem 9.4 

Show, from equations ( 8 . 1 ) and ( 8 . 2 ), that the magnetic field in the plane-polarized electromagnetic 
wave of Problem 9.3 has components in both x- and y-directions. [When an electromagnetic wave 
propagating in a waveguide has only transverse electric field vectors and no electric field in the 
direction of propagation it is called a transverse electric (TE) wave. Similarly a transverse magnetic 
(TM) wave may exist. The plane-polarized wave of Problem 9.3 is a transverse electric wave; the 
corresponding transverse magnetic wave would have H Z ,E X and E y components. The values of n in 
Problem 9.3 satisfying the boundary conditions are written as subscripts to define the exact mode of 
propagation, e.g. TE io-] 

Problem 9.5 

Use the value of the inductance and capacitance of a pair of plane parallel conductors of separation a 
and width b to show that the characteristic impedance of such a waveguide is given by 

where and e are respectively the permeability and permittivity of the medium between the 
conductors. 
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Problem 9.6 

Consider either the Poynting vector or the energy per unit volume of an electromagnetic wave to 
show that the power transmitted by a single positive travelling wave in the waveguide of Problem 9.5 
is \abE\^/ ej n. 



Problem 9.7 

An electromagnetic wave (E, H) propagates in the x-direction down a perfectly conducting hollow 
tube of arbitrary cross section. The tangential component of E at the conducting walls must be zero 
at all times. 

Show that the solution E = E(y,z ) ncos (cot — k x x) substituted in the wave equation yields 



d 2 E(y,z) d 2 E(y,z) 
dy 2 dz 2 



k 2 E(y,z ), 



where k 2 = u> 2 /c 2 — k 2 and k x is the wave number appropriate to the x-direction, n is the unit vector 
in any direction in the (y, z) plane. 



Problem 9.8 

If the waveguide of Problem 9.7 is of rectangular cross-section of width a in the y-direction and 
height b in the z-direction, show that the boundary conditions E x = 0 at y = 0 and a and at z = 0 and 
b in the wave equation of Problem 9.7 gives 



. miry . nnz , , 

E x = A sin sin cos (u>t — k x x), 

a b 



where 




Problem 9.9 

Show, from Problems 9.7 and 9.8, that the lowest possible value of to (the cut-off frequency) for k x to 
be real is given by m = n = 1. 

Problem 9.10 

Prove that the product of the phase and group velocity uj/k x , du:/dk x of the wave of Problems 9.7- 
9.9 is c 2 , where c is the velocity of light. 

Problem 9.11 

Consider now the extension of Problem 9.2 where the waves are reflected at the rigid edges of the 
rectangular membrane of sides length a and b as shown in the diagram. The final displacement is the 
result of the superposition 

z = A e i[ut-(kix+k 2 y)\ 

+ At e i [w<-(*a-*2r)] 

+ A 3 

+ A4 e i [ w,_ ( _tlJC+t2 ^)l 

with the boundary conditions 

z = 0 at x = 0 and x = a 
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and 



z = 0 at y = 0 and y = b 




Show that this leads to a displacement 



(the real part of z), where 



2 = — 4Ai s'mkixsinkiycosujt 



I Hl7F A I 

k\ — and k 2 

a 



n 2 7r 

b 



Problem 9.12 

In deriving the result that the average energy of an oscillator at frequency v and temperature T is 
given by 

iw 

E = e(W*r) _ i 

Planck assumed that a large number N of oscillators had energies 0, his, 2hv . . . nhv distributed 
according to Boltzmann’s Law 



N n = N Q &- nhvlkT 



where the number of oscillators N n with energy nhv decreases exponentially with increasing n. 
Use the geometric progression series 

N N„ = N 0 (l +e - hv/kT + e - 2hvlkT . . .) 

n 



to show that 



N = 



Np 

1 g —hv/kT 



If the total energy of the oscillators in the nth energy state is given by 



E„ = N„nhv 
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prove that the total energy over all the n energy states is given by 



E = E n = N 0 

n 



hue -H kT 
( 1 — e -hv/kT) 2 



Hence show that the average energy per oscillator 

E hv 

£ J\f e hv/kT _ j 

and expand the denominator to show that for hv <C kT, that is low frequencies and long wavelengths. 
Planck's Law becomes the classical expression of Rayleigh-Jeans. 



Problem 9.13 

The wave representation of a particle, e.g. an electron, in a rectangular potential well throughout 
which V = 0 is given by Schrodinger’s time-independent equation 

d 2 V dH d 2 ^ 8iv 2 m 

1 1 = L'T 

dx 2 dy 2 dz 2 h 2 



where E is the particle energy, m is the mass and h is Planck’s constant. The boundary conditions to 
be satisfied are ip = 0 at x = y = z = 0 and at x = L x , y = L y , z = L z , where L x , L y and L z are the 
dimensions of the well. 

Show that 



hrx 

'I' = A sin — 
L x 



r icy 




sin 



mrz 

L z 



is a solution of Schrodinger’s equation, giving 

E - 8m\Ll + L 2 + L 2 j 



When the potential well is cubical of side L, 

£= 8^ (/2 + r2 + M2) 

and the lowest value of the quantized energy is given by 

E = Eq for 1=1, r = n = 0 



Show that the next energy levels are 3Eq,6Eq (three-fold degenerate), 9Eq (three-fold 
degenerate), 11 £q (three-fold degenerate), 12Eq and 14-fio (six-fold degenerate). 



Problem 9.14 

Show that at low energy levels (long wavelengths) hv <C kT , Planck’s radiation law is equivalent to 
the Rayleigh-Jeans expression. 



Problem 9.15 

Planck's radiation law, expressed in terms of energy per unit range of wavelength instead of 
frequency, becomes 
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Use the variable x= ch/XkT to show that the total energy per unit volume at temperature T° 
absolute is given by 



where 



•OO 

E \ dA = aT 4 J m -3 

J o 



(The constant co/4 



_ 8tt 5 £ 4 

5^10 

a, Stefan’s Constant in the Stefan-Boltzmann Law.) Note that 

,0 ° X 3 Ck _ 7T 4 

,0 e* - 1 “ 15 



Problem 9.16 

Show that the wavelength A„, at which E\ in Problem 9.15 is a maximum is given by the solution of 

( 1 ~$ eX=l ' Wh6re X = ^f 

The solution is ch/X m kT = 4.965. 

Problem 9.17 

Given that ch/X m = 5 kT in Problem 9.16, show that if the sun’s temperature is about 6000 K, then 
A„, ~ 4.7 x 10 ~ 7 m, the green region of the visible spectrum where the human eye is most sensitive 
(evolution ?). 



Problem 9.18 

The tungsten filament of an electric light bulb has a temperature of w 2000 K. Show that in this case 
A„, w 14 x 10 _7 m, well into the infrared. Such a lamp is therefore a good heat source but an 
inefficient light source. 



Problem 9.19 

A free electron (travelling in a region of zero potential) has an energy 




2 in 




E(k) 



where the wavelength 

A = h/p = 2n/k 

In a weak periodic potential; for example, in a solid which is a good electrical conductor, 
E = (h 1 /2m*) k 2 , where m* is called the effective mass. (For valence electrons m* ~ m.) 

Represented as waves, the electrons in a cubic potential well (V = 0) of side L have allowed wave 
numbers k, where 

k 2 = k 2 + k 2 + k 2 and k t = 

x y z £ 

(see Problem 9.13). For each value of k there are two allowed states (each defining the spin state of 
the single occupying electron-Pauli's principle). Use the arguments in Chapter 9 to show that the 
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total number of states in k space between the values k and Ar + dA: is given by 



P «= 2 U —* 



L\ 3 47 ik 2 d k 



Use the expression E = (fi 2 /2m*) k 2 to convert this into the number of states S(E) dE in the energy 
interval dE to give 

A (1 m\ 3 ^ 2 r— 

sm = iP- F* 21 



where A = L 3 . 

If there are N electrons in the N lowest energy states consistent with Pauli’s principle, show that 
the integral 



Ef 

S(E) 
J o 



dE = N 



gives the Fermi energy level 



h 2 T3n 2 N\ 2/3 



2m* V A 



where E[ is the kinetic energy of the most energetic electron when the solid is in its ground state. 



Summary of Important Results 

Wave Equation in Two Dimensions 

d 2 z d 2 z _ 1 d 2 z 

dx 2 dy 2 c 2 dt 2 

Lines of constant phase Ix+my — ct propagate in direction k(k t . ki) where l — 
k\/k,m — k2/k,k 2 = k 2 + k 2 and c 2 = ut 2 /k 2 . Solution is 

z =Ae i(a *“ k ' r) for r(x, y) 

where k ■ r = k\x + kyy. 

Wave Equation in Three Dimensions 

d 2 (f) d 2 (f> d 2 cj) 1 d 2 (f) 

dx 2 dy 2 dz 2 c 2 dt 2 

Planes of constant phase lx + my + nz — ct propagate in a direction 

k(fci, k2, k$), where l = k\/k , m — k2/k, n — k^/k 
k 2 — k 2 + k 2 + k 2 and c 2 =ui 2 /k 2 . 
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Solution is 

cj) = A e 1 (“ ; “ k ' r) for r (x,y,z)- 



Wave Guides 

Reflection from walls y = 0,y = b in a two-dimensional wave guide for a wave of 
frequency u) and vector direction k(k | . kf) gives normal modes in the y direction with 
ki = nir/b and propagation in the x direction with phase velocity 



and group velocity 



U) U) 




du j 



v„ = — — such that 
8 dk i 



Cut-off frequency 

Only frequencies u > rnrv/b will propagate where n is mode number. 



Normal Modes in Three Dimensions 



Wave equation separates into three equations (one for each variable x, y, z) to give solution 



= A 



sin , sin . sin . sin 

k \x koy k^z wt 
cos cos cos cos 



(Boundary conditions determine final form of solution.) 



For waves of velocity c, the number of normal modes per unit volume of an enclosure in 
the frequency range v to v + do 

4Tro 2 Av 
c 3 

Directly applicable to 

• Planck’s Radiation Law 

• Debye’s theory of specific heats of solids 

• Fermi energy level (Problem 9.19) 





10 

Fourier Methods 



Fourier Series 

In this chapter we are going to look in more detail at the implications of the principles of 
superposition which we met at the beginning of the book when we added the two separate 
solutions of the simple harmonic motion equation. Our discussion of monochromatic 
waves has led to the idea of repetitive behaviour in a simple form. Now we consider more 
complicated forms of repetition which arise from superposition. 

Any function which repeats itself regularly over a given interval of space or time is 
called a periodic function. This may be expressed by writing it as f(x) = f(x ± a) where a 
is the interval or period. 

The simplest examples of a periodic function are sines and cosines of fixed frequency 
and wavelength, where a represents the period r, the wavelength A or the phase angle 
27rrad, according to the form of x. Most periodic functions for example the square wave 
system of Figure 10. 1 , although quite simple to visualize are more complicated to represent 
mathematically. Fortunately this can be done for almost all periodic functions of interest in 
physics using the method of Fourier Series, which states that any periodic function may be 
represented by the series 

fix) = la o + a i cosx + u? cos 2 x . . . + a n cos nx 
W 2 - ( 10 T) 

+ b i sin .v + l ?2 sin 2x . . . + b n sin nx, 



that is, a constant | a q plus sine and cosine terms of different amplitudes, having 
frequencies which increase in discrete steps. Such a series must of course, satisfy certain 
conditions, chiefly those of convergence. These convergence criteria are met for a function 
with discontinuities which are not too severe and with first and second differential 
coefficients which are well behaved. At such discontinuities, for instance in the square 
wave where /(x) = ±/z at x = 0, ±27t, etc. the series represents the mean of the values of 
the function just to the left and just to the right of the discontinuity. 
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TT 

h 

J 0 



f(x) = ^2 (sin x + -1- sin 3x + -1- sin 5x + y sin lx . . . ) 



2k 



An 



Figure 10.1 Square wave of height h and its Fourier sine series representation (odd function) 



We may write the series in several equivalent forms: 

| OO 

/(■*) = x«o + E (a i, cos nx + b n sin nx) 

2 n= l 

j OO 

= 9 a ° + X] C " COS ( HX ~ ^") 

^ n=l 



where 



and 



2 

ft 



or 



where 



and 



tan 0 n b n f ci n 

OO 

f( X )= Y, d nZ™ 

n=— oo 

2d n =a n - ib n (n > 0) 
2d n = a ~„ + i b-„(n < 0) 



To find the values of the coefficients a „ and b„ let us multiply both sides of equation 
(10.1) by cos nx and integrate with respect to x over the period 0 to 27r (say). 

Every term 



whilst every term 



*27T 

cosmxcosnxdx 
. o 



0 if m yF n 
7 r if m = n 



*27 T 

sin mx cos nx dx = 0 for all m and n. 
. o 
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Thus for m = n. 



so that 



a n 



*27 T 

cos 2 nxAx — ira n 
. o 



i r 27r 

a„=— /(x)cosnxdx 

™ Jo 



Similarly, by multiplying both sides of equation (10.1) by sin nx and integrating from 0 
to 27 t we have, since 



*27T 

sin mx sin nx dx 
. o 



0 if m 7 ^ n 
7 r if m = n 



that 



1 

b n — — f(x) sinnxdx 

k Jo 

Immediately we see that the constant (n = 0), given by \ao = \/2ir J? 77 f(x) dx, is just 
the average of the function over the interval 27r. It is, therefore, the steady or 'd.c.' level on 
which the alternating sine and cosine components of the series are superimposed, and the 
constant can be varied by moving the function with respect to the x-axis. When a periodic 
function is symmetric about the x-axis its average value, that is, its steady or d.c. base level, 
/no, is zero, as in the square wave system of Figure 10.1. If we raise the square waves so 
that they stand as pulses of height 2 h on the x-axis, the value of \ciq is 1m (average value 
over 2tt). The values of a n represent twice the average value of the product/(x) cos nx over 
the interval 2-7r; b„ can be interpreted in a similar way. 

We see also that the series representation of the function is the sum of cosine terms 
which are even functions [cosx = cos (— x)] and of sine terms which are odd functions 
[sinx = — sin (— x)]. Now every function f(x) = \ [/(x) +/(— x)] +\[f{x) — /(— x)], in 
which the first bracket is even and the second bracket is odd. Thus, the cosine part of a 
Fourier series represents the even part of the function and the sine terms represent the odd 
part of the function. Taking the argument one stage further, a function/(x) which is an even 
function is represented by a Fourier series having only cosine terms; if/(x) is odd it will 
have only sine terms in its Fourier representation. Whether a function is completely even or 
completely odd can often be determined by the position of the y-axis. Our square wave of 
Figure 10.1 is an odd function [/(x) = — /(— x)]; it has no constant and is represented by 
f(x) = Ah / 7r( sinx + 1/3 sin 3x +l/5sin5x, etc. but if we now move the y-axis a half 
period to the right as in Figure 10.2, then f{x) =/(— x), an even function, and the square 
wave is represented by 



/(*) 



Ah 

7 r 



(cosx 



j cos 3x + / cos 5x — lj cos lx + ■ ■ ■) 
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f(x) = ^7 ( C0S X - -1- cos 3x + -L COS 5x - 4" cos 7x . . . ) 
7C 3 5 7 
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Figure 10.2 The wave of Figure 10.1 is now symmetric about they axis and becomes a cosine series 
(even function) 



If we take the first three or four terms of the series representing the square wave of 
Figure 10.1 and add them together, the result is Figure 10.3. The fundamental, or first 
harmonic, has the frequency of the square wave and the higher frequencies build up the 
squareness of the wave. The highest frequencies are responsible for the sharpness of the 
vertical sides of the waves; this type of square wave is commonly used to test the frequency 
response of amplifiers. An amplifier with a square wave input effectively ‘Fourier analyses’ 
the input and responds to the individual frequency components. It then puts them together 
again at its output, and if a perfect square wave emerges from the amplifier it proves that 
the amplifier can handle the whole range of the frequency components equally well. Loss 
of sharpness at the edges of the waves shows that the amplifier response is limited at the 
higher frequency range. 




addition of first 
three terms 



Figure 10.3 Addition of the first three terms of the Fourier series for the square wave of Figure 10.1 
shows that the higher frequencies are responsible for sharpening the edges of the pulse 
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Exampie of Fourier Series 

Consider the square wave of height h in Figure 10.1. The value of the function is given by 

f(x) — h for 0 < x < 7r 



and 



f(x) = —h for tt < x <2 tt 
The coefficients of the series representation are given by 



because 



and 



1 

Q. n 


- 


*7 r 


* 27T 


h 


cosnxdx — h 


cosnxdx 


7 r 


_ 


0 


7 r 



*7 T 27T 

cosnxdx = cos«xdx = 0 

«. 0 t 7 r 



bn — 


- 


*7 r 


*27 r 


h 


sin nx dx — h 


sin nx dx 


7T 


. 


0 


7T 




= — [(1 — cos rnr) + ( 1 — cos mr)\ 
mr 

giving b n = 0 for n even and b n = Ah/mt for n odd. Thus, the Fourier series representation 
of the square wave is given by 



fix) 



4h ( sin3x sin5x 

— sinxH 1 — 

7T V 3 5 



~ + '"J 



Fourier Series for any Interval 

Although we have discussed the Fourier representation in terms of a periodic function its 
application is much more fundamental, for any section or interval of a well behaved 
function may be chosen and expressed in terms of a Fourier series. This series will 
accurately represent the function only within the chosen interval. If applied outside that 
interval it will not follow the function but will periodically repeat the value of the function 
within the chosen interval. If we represent this interval by a Fourier cosine series the 
repetition will be that of an even function, if the representation is a Fourier sine series an 
odd function repetition will follow. 
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Figure 10.4 A Fourier series may represent a function over a selected half-interval. The general 
function in (a) is represented in the half-interval 0 < x < 1/2 by/ e , an even function cosine series in 
(b), and by an odd function sine series in (c). These representations are valid only in the specified 
half-interval. Their behaviour outside that half-interval is purely repetitive and departs from the 
original function 



Suppose now that we are interested in the behaviour of a function over only one-half of 
its full interval and have no interest in its representation outside this restricted region. In 
Figure 10.4a the function /(v) is shown over its full space interval —1/2 to +Z/2, but fix) 
can be represented completely in the interval 0 to +Z/2 by either a cosine function (which 
will repeat itself each half-interval as an even function) or it can be represented completely 
by a sine function, in which case it will repeat itself each half-interval as an odd function. 
Neither representation will match f{x) outside the region 0 to +Z/2, but in the half-interval 
0 to +1/2 we can write 



f(x) =f e (x) =f 0 (x) 



where the subscripts e and o are the even (cosine) or odd (sine) Fourier representations, 
respectively. 

The arguments of sines and cosines must, of course, be phase angles, and so far the 
variables x has been measured in radians. Now, however, the interval is specified as a 
distance and the variable becomes 2t xx/l, so that each time x changes by l the phase angle 
changes by 2tr. 

Thus 



/eW 






n= 1 



27 vnx 
a n cos — - — 




Fourier Series 



where 



2 interval 



l / 2 2 7172 r 

/(x) cos — - — d.r 

-;/ 2 ‘ 



2nnx 



f e (x) cos — — dx + 

_ -z/2 l 

1 / 2 2 TUI X 

f(x) cos — — dx 
n / 



27 rnx 

Je(x) COS — - — dx 



because 



/(x) =/ e (x) from x = 0 to Z/2 



and 



/(x) =/(— x) — / e (x) from x = 0 to — Z/2 
Similarly we can represent /(x) by the sine series 

Imix 



fix) =fo(x) = b n sin 

n= 1 

in the range x = 0 to 1/2 with 



bn " i . 

* t 



j interval 



1/2 f! ^ ■ 2lTnX A 

j (x) sin — - — dx 

-i/ 2 



1 



2imx 



-// 2 



/o(x) sin — dx + 



Z/2 



2irnx 



f 0 (x) sin - dx 
o ' 



In the second integral / 0 (x) =/(x) in the interval 0 to 1/2 whilst 



2 7172 X 



-Z/2 



/ 0 (x) sin — - — dx = 



Hence 



Z/2 

■Z/2 

0 



2tt;ix 

jo (— x) sin — - — dx = — 



27TZ1X 

jo\X) sin — - — dx = 



Z/2 



/ 0 (x) sin 



27TZZX 

/ 



V 2 2tthx 

f(x) sin — — dx 
o ‘ 



b " = 7 



V 2 2ttjix 

f(x) sin — — dx 

0 l 
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If we follow the behaviour of / e (x) and / 0 (x) outside the half-interval 0 to 1/2 (Fig- 
ure 10.4a, b) we see that they no longer represent /'(x). 



Application of Fourier Sine Series to a Triangular Function 

Figure 10.5 shows a function which we are going to describe by a sine series in the half- 
interval 0 to 7T. The function is 



fix) =x (o<x<^) 

and 

fix) = tt-x (| <x<7r ) 



Writing /(x) = Y) b„ sin nx gives 



2 r /2 

b n — — xsinnxdx 

X Jo 



nn 



sin 



2 f 77 

— ( 7 r — x) sin nx dx 

X , 7r/2 



When n is even sinwr/2 = 0, so that only terms with odd values of n are present and 



, 4 /sinx sin3x sin5x sin7x 

fix) = - 



TT \ 1 2 3 2 

Note that at x = tt/2, /(x) = 7r/2, giving 



7T“ 



1 1 1 1 

I 2 + 3 2 + 5 2 + “^ Ifffl 



n= 0 ( 2 « + 1 )‘ 



We shall use this result a little later. 



f(x) = x(0<x<-|) 




Figure 10.5 Function representing a plucked string and defined over a limited interval. When the 
string vibrates all the permitted harmonics contribute to the initial configuration 
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Note that the solid line in the interval 0 to — 7r in Figure 10.5 is the Fourier sine 
representation for f(x) repeated outside the interval 0 to 7r whilst the dotted line would 
result if we had represented f(x) in the interval 0 to 7r by an even cosine series. 



(Problems 10.1, 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 10. 



9 ) 



Application to the Energy in the Normal Modes of a Vibrating 
String 



If we take a string of length 1 with fixed ends and pluck its centre a distance d we have the 
configuration of the half interval 0 to 7r of Figure 10.5 which we represented as a Fourier 
sine series. Releasing the string will set up its normal mode or standing wave vibrations, 
each of which we have shown on p. 126 to have the displacement 

y n = {A„ cos LO,,t + B n sinuv) sin — — (5.10) 

c 

where u>„ = iittc/ I is the normal mode frequency. 

The total displacement, which represents the shape of the plucked string at t = 0 is given 
by summing the normal modes 



U)„X 



y = 



E , v u/ n-^ 

(A n cos ic„t + B n sin oj„t) sin 



Note that this sum resembles a Fourier series where the fixed ends of the string, y = 0 at 
x — 0 and x = l allow only the sine terms in x in the series expansion. If the string remains 
plucked at rest only the terms in x with appropriate coefficients are required to describe it, 
but its vibrational motion after release has a time dependence which is expressed in each 
harmonic coefficient as 

A n cos u n t + B n since,,? 



The significance of these coefficients emerges when we consider the initial or boundary 
conditions in time. 

Let us write the total displacement of the string at time t = 0 as 



y 0 (x) = ^ y n {x) = ^ {A n cos ce„? + B„ since,, t) sin — 



LO n X 

c 






C J„X 



sin 



at t = 0 



Similarly we write the velocity of the string at time t = 0 as 
d 



y o{x ) - g~ t y o(x) = y„{x ) 

= ^ (— c o n A n since„f + t o n B n cos uj n t) sin 

E . i J n X 

uj n B n sin at t = 0 



QJ„x 

c 



Both yo{x) and vo(x') are thus expressed as Fourier sine series, but if the string is at rest at 
t — 0, then vq{x) — 0 and all the B n coefficients are zero, leaving only the A„’ s. If the 
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displacement of the string yo(x) = 0 at time t = 0 whilst the string is moving, then all the 
A n ’ s are zero and the Fourier coefficients are the u> n B„’s. 

We can solve for both A„ and u>„B n in the usual way for if 



and 



yo(x) = Y] A„ sin — 
c 



vo{x ) = 



uj„B n sin 



u> n x 

c 



for a string of length / then 



and 



A = 2 
" l 



, \ . U) n X 

yoU') sin dx 

o c 



d ^ t \ ' ^ nX , 

ui n B n = — vq{x) sin dx 

l Jo c 



If the plucked string of mass m (linear density p) is released from rest at 
t = 0 (vt)(x) — 0) the energy in each of its normal modes of vibration, given on p. 134 as 

E n = \mut 2 n (Al+B 2 n ) 

is simply 

E„ = \mw 2 „A 2 n 

because all 6,,’s are zero. 

The total vibrational energy of the released string will be the sum ^ E„ over all the 
modes present in the vibration. 

Let us now solve the problem of the plucked string released from rest. The configuration 
of Figure 10.5 (string length I, centre plucked a distance d) is given by 

2 dx l 

yo(x) = — o<x<- 

2 d(l — x) l 
= 2 ^' 

SO 



A 



n 



2 

7 



‘^ 2 2 dx . uj n x , 

— — sin dx 

, o l c 



' l 2 d(I - x) 
. // 2 / 



. 0J„X 

sin dx 

c 



8 d . rnr / 

-y—y sm ~xr for 
n-7r- 2 V 



nnc\ 

~T) 



We see at once that A„ = 0 for n even (when the sine term is zero) so that all even 
harmonic modes are missing. The physical explanation for this is that the even harmonics 
would require a node at the centre of the string which is always moving after release. 
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The displacement of our plucked string is therefore given by the addition of all the 
permitted (odd) modes as 



yo(x) = Y y nipt) = Y A n sin 



Hodd 



n odd 



where 



8 d . nir 
A n = — sin — 
ft z 7T z 2 

The energy of the nth mode of oscillation is 

p _ * . 2,2 _ Md 2 muj 2 n 

4 " " 4(w 2 7t 2 ) 2 

and the total vibrational energy of the string is given by 

Y^ 16 d 2 m ^ w 2 16 d 2 c 2 m y^ 1 

h ~ 2-^1 tn ~ 7t-4 2^ TT _ 77-2/2 ff 

n odd n odd n odd 



for 



nnc 



^ n 



But we saw in the last section that 



1 7T 2 



« odd 



SO 



e = YEu = 



2mc 2 d 2 2 Td 2 



l 2 



1 



where T = pc 2 is the constant tension in the string. 

This vibrational energy, in the absence of dissipation, must be equal to the potential 
energy of the plucked string before release and the reader should prove this by calculating 
the work done in plucking the centre of the string a small distance d, where d /. 

To summarize, our plucked string can be represented as a sine series of Fourier 
components, each giving an allowed normal mode of vibration when it is released. The 
concept of normal modes allows the energies of each mode to be added to give the total 
energy of vibration which must equal the potential energy of the plucked string before 
release. The energy of the nth mode is proportional to n~ 2 and therefore decreases with 
increasing frequency. Even modes are forbidden by the initial boundary conditions. 
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The boundary conditions determine which modes are allowed. If the string were struck 
by a hammer those harmonics having a node at the point of impact would be absent, as in 
the case of the plucked string. Pianos are commonly designed with the hammer striking a 
point one seventh of the way along the string, thus eliminating the seventh harmonic which 
combines to produce discordant effects. 

Fourier Series Analysis of a Rectangular Velocity Pulse on a 
String 

Let us now consider a problem similar to that of the last section except that now the 
displacement yo(x) of the string is zero at time r = 0 whilst the velocity vo(x) is non-zero. 
A string of length l, fixed at both ends, is struck by a mallet of width a about its centre 
point. At the moment of impact the displacement 

yo(*) = 0 



but the velocity 



Vo(x) 



dy o(*) _ Q 
dt 



for 



= v for 




a 

> - 
- 2 



a 




This situation is shown in Figure 10.6. 
The Fourier series is given by 



vq(x) 



E \ ' _ . 0J n X 

y n — / J ^ n,B n sin — 



where 



UJ rjd n 



2 

7 



rl/2+a/l 

J +1/2— a/2 



. U n X 

rsin 

c 



dx 



4v mr 
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Figure 10.6 Velocity distribution at time t = 0 of a string length /, fixed at both ends and struck 
about its centre point by a mallet of width a. Displacement y 0 (x) = 0; velocity vq(x) = v for 
|x — // 2\ < a/2 and zero outside this region 
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Again we see that u> n B n = 0 for n even (sin mt/2 = 0) because the centre point of the 
string is never stationary, as is required in an even harmonic. 

Thus 



»oW = 

n odd 



4v . mra . uj n x 

— sin — — sin 

mr 21 c 



The energy per mode of oscillation 






rBl) 

--\mw 2 n B 2 n (All A n ’s — 0) 

. I6v 2 . 2 nita 
~~ 4 m V2Z2 sin WT 



Now 



^ n 

n = — 

UJl 



UJ 

7 TC 



for the fundamental frequency 



So 



7 TC 

"i = T 



F 
^ n 



Amv 2 c 2 

l 2 u>l 



sin 2 



LO n a 

^7 



Again we see, since uj„ oc n that the energy of the nth mode oc n 2 and decreases with 
increasing harmonic frequency. We may show this by rewriting 

mv 2 a 2 sin 2 (cu„a/2c) 

E 'A UJ ) = 72 7 /0 x2 

< (ui n a/2c) 

mv 2 a 2 sin 2 a 

I 2 a 2 



where 



a = uj n a/2c 

and plotting this expression as an energy-frequency spectrum in Figure 10.7. 
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Figure 10.7 (a) Distribution of the energy in the harmonics uj„ of the string of Figure 10.6. The 

spectrum E n (u) oc sin 2 a/a 2 where a = uj„a/2c. Most of the energy in the string is contained in the 
frequency range A u> « 27 r c/a, and for a = Ax (the spatial width of the pulse), A x/c = At and 
AujAtx 2 -k (Bandwidth Theorem). Note that the values of E n (co) for w 3 , u> 5 , w 7 , etc. are magnified 
for clarity, (b) The true shape of the pulse 



The familiar curve of sin 2 a/a 2 again appears as the envelope of the energy values for 
each uj n . 

If the energy at cui is E\ then E 3 = E i/9 and E 5 =£1/25 so the major portion of the 
energy in the velocity pulse is to be found in the low frequencies. The first zero of the 
envelope sin 2 a/o 2 occurs when 

uja 

a = — = 7r 
2c 

so the width of the central frequency pulse containing most of the energy is given by 

27TC 

U) « 

a 



This range of energy-bearing harmonics is known as the ‘spectral width’ of the pulse 
written 




The ‘spatial width’ a of the pulse may be written as A* so we have 



Ax A LO « 2-7TC 
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Reducing the width Ax of the mallet will increase the range of frequencies Ac o required 
to take up the energy in the rectangular velocity pulse. Now c is the velocity of waves on 
the string so a wave travels a distance Ax along the string in a time 

At = A x/c 

which defines the duration of the pulse giving 

AcoAt « 27t 



or 



AvAt k 1 

the Bandwidth Theorem we first met on p. 134. 

Note that the harmonics have frequencies 

imc 

Un= — 

so 7 tc/I is the harmonic interval. When the length / of the string becomes very long and 
/ — > oo so that the pulse is isolated and non-periodic, the harmonic interval becomes so 
small that it becomes differential and the Fourier series summation becomes the Fourier 
Integral discussed on p. 283. 

The Spectrum of a Fourier Series 

The Fourier series can always be represented as a frequency spectrum. In Figure 10.8 a the 
relative amplitudes of the frequency components of the square wave of Figure 1 0. 1 are 
plotted, each sine term giving a single spectral line. In a similar manner, the distribution of 
energy with frequency may be displayed for the plucked string of the earlier section. The 
frequency of the r th mode of vibration is given by uj r = me /l, and the energy in each 
mode varies inversely with r 2 , where r is odd. The spectrum of energy distribution is 
therefore given by Figure 10.8 b. 

Suppose now that the length of this string is halved but that the total energy remains 
constant. The frequency of the fundamental is now increased to uj' r = 2mc/l and 
the frequency interval between consecutive spectral lines is doubled (Figure 10.8 c). Again, 
the smaller the region in which a given amount of energy is concentrated the wider the 
frequency spectrum required to represent it. 

Frequently, as in the next section, a Fourier series is expressed in its complex or 
exponential form 



OO 

f(t)= £ d„ e inuJt 

n = — oo 
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x 3n 5x lx 

(a) 




(d) (e) 



Figure 10.8 (a) Fourier sine series of a square wave represented as a frequency spectrum; (b) 

energy spectrum of a plucked string of length t; and (c) the energy spectrum of a plucked string of 
length 1/2 with the same totaL energy as (b), demonstrating the Bandwidth Theorem that the greater 
the concentration of the energy in space or time the wider its frequency spectrum. Complex 
exponential frequency spectrum of (d) cos tut and (e) sin tut 
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where 2 d n = a„ — ib n (n > 0) and 2 d n = a - n + ib_„(n < 0). 
Because 



cos nut = k(e inuJt + e~ inu,t ) 

and 

sin no;/ = (e i,!wr — e _i ” w ') 

2i v 2 

a frequency spectrum in the complex plane produces two spectral lines for each frequency 
component nw, one at +nw and the other at —nw. Figure 10.8 d shows the cosine 
representation, which lies wholly in the real plane, and Figure 10.8 e shows the sine 
representation, which is wholly imaginary. The amplitudes of the lines in the positive and 
negative frequency ranges are, of course, complex conjugates, and the modulus of their 
product gives the square of the true amplitude. The concept of a negative frequency is seen 
to arise because the e ~ inwt term increases its phase in the opposite sense to that of the 
positive term e inwt . The negative amplitude of the negative frequency in the sine repre- 
sentation indicates that it is in antiphase with respect to that of the positive term. 

Fourier Integral 

At the beginning of this chapter we saw that one Fourier representation of the function 
could be written 

OO 

/(*)= E d ^ wx 

n =— co 

where 2 d n = a n — i b„(n > 0) and 2 d„ = a_„ + i b-„(n < 0). 

If we use the time as a variable we may rewrite this as 

CO 

m= Y, d ^' nut 

n=—oo 



where, if T is the period. 



d n 



1 f r / 2 

- f(t) e~ inut dt 

1 J-T/2 



(for n = —2, —1,0, 1,2, etc.). 

If we write uj = Inu \ , where v \ is the fundamental frequency, we can write 



OO 

m = E 

n =— co 



oT / 2 

/(f')e“ i27rm/lf 'dt' 

J -T / 2 



g ilimu 1 1 



l 

T 
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If we now let the period T approach infinity we are isolating a single pulse by saying that 
it will not be repeated for an infinite period; the frequency o\ = l/T — > 0, and 1 /T 
becomes infinitesimal and may be written do. 

Furthermore, n times v\, when n becomes as large as we please and \/T = o\ — > 0, may 
be written as nv\ = o, and the sum over n now becomes an integral, since unit change in n 
produces an infinitesimal change in n/T = no j. 

Hence, for an inhnite period, that is for a single non-periodic pulse, we may write 



fit) 



fit ') e 



—il'KVt' 



At' 



f 2m, do 



which is called the Fourier Integral. 
We may express this as 



fit) 



•OO 

F{o) e' 2%vt do 

— OO 



where 



F(o) 



*oo 

/(/) dt' 

— OO 



is called the Fourier Transform of /(f). We shall discuss the transform in more detail in a 
later section of this chapter. 

We see that when the period is finite and /(f) is periodic, the expression 



OO 

fit) = £ dn^ 

n = — oo 



tells us that the representation is in terms of an infinite number of different frequencies, 
each frequency separated by a finite amount from its nearest neighbour, but when /(f) is not 
periodic and has an infinite period then 



fit) 



•OO 

F(o) &' 2mt do 

— OO 



and this expression is the integral (not the sum) of an infinite number of frequency 
components of amplitude F( v) d o infinitely close together, since o varies continuously 
instead of in discrete steps. 

For a periodic function the amplitude of the Fourier series coefficient 



n 



l 

T 



T/2 

J -T/2 



d, 



/(f) e~ inuJt dt 
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whereas the corresponding amplitude in the Fourier integral is 



F{v) Av = 




•OO 

/(*') e - 5 **' dr' 

J — OO 



This corroborates the statement we made when discussing the frequency spectrum that 
the narrower or less extended the pulse the wider the range of frequency components 
required to represent it. A truly monochromatic wave of one frequency and wavelength (or 
wave number) requires a wave train of infinite length before it is properly defined. 

No wave train of finite length can be defined in terms of one unique wavelength. 

Since a monochromatic wave, infinitely long, of single frequency and constant amplitude 
transmits no information, its amplitude must be modified by adding other frequencies (as 
we have seen in Chapter 5) before the variation in amplitude can convey information. 
These ideas are expressed in terms of the Bandwidth Theorem. 



Fourier Transforms 

We have just seen that the Fourier integral representing a non-periodic wave group can be 
written 



f(t) 



•OO 

F{y) e i2wpl An 

— OO 



where its Fourier transform 






*oo 

/(f')e _iW At' 

— OO 



so that integration with respect to one variable produces a function of the other. Both 
variables appear as a product in the index of an exponential, and this product must be non- 
dimensional. Any pair of variables which satisfy this criterion forms a Fourier pair of 
transforms, since from the symmetry of the expressions we see immediately that if 

F(v) is the Fourier transform of /(f) 

then 

f{—v) is the Fourier transform of Fit) 



If we are given the distribution in time of a function we can immediately express it as a 
spectrum of frequency, and vice versa. In the same way, a given distribution in space can be 
expressed as a function of wave numbers (this merely involves a factor, 1/27T, in front of 
the transform because k = 2n/X). 

A similar factor appears if c o is used instead of v. If the function of /(f) is even only the 
cosine of the exponential is operative, and we have a Fourier cosine transform 



/(0 



•OO 

F(y) cos lirvtAv 
. o 
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and 



Fiy) 



*oo 

/(f) cos Itnstdt 
. o 



If /(f) is odd only the sine terms operate, and sine terms replace the cosines above. Note 
that only positive frequencies appear. The Fourier transform of an even function is real and 
even, whilst that of an odd function is imaginary and odd. 



Examples of Fourier Transforms 

The two examples of Fourier transforms chosen to illustrate the method are of great 
physical significance. They are 

1. The ‘slit’ function of Figure 10.9a, 

2. The Gaussian function of Figure 10.11. 

As shown, they are both even functions and their transforms are therefore real; the physical 
significance of this is that all the frequency components have the same phase at zero time. 



The Slit Function 

This is a function having height h over the time range ±d/2. Thus, /(f) = h for |f| < d/2 
and zero for |f| > d/2, so that 






fit) 



„ —H'KVt 



d t = 



rd/2 

he~ i2mt dt 

, -d/2 



\ —\2'Kvd/2 

i2ni/ [ 



- e +i2 ™*/ 2 ] = hd^ 
a 



(a) 



t 

h 

t= 0 
- d-> 



time 



(b) 




Figure 10.9 (a) Narrow slit function of extent d in time and of height h, and (b) its Fourier 

transform 
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where 

lin'd 



Again we see the Fourier transformation of a rectangular pulse in time to a sin a/ a pattern 
in frequency. The Fourier transform of the same pulse in space will give the same 
distribution as a function of wavelength. Figure 10.9b shows that as the pulse width 
decreases in time the separation between the zeros of the transform is increased. The 
negative values in the spectrum of the transform indicate a phase reversal for the amplitude 
of the corresponding frequency component. 

The Fourier Transform Applied to Optical Diffraction from a 
Single Slit 

This topic belongs more properly to the next chapter where it will be treated by another 
method, but here we derive the fundamental result as an example of the Fourier Transform. 
The elegance of this method is seen in problems more complicated than the one- 
dimensional example considered here. We shall see its extension to two dimensions in 
Chapter 12 when we consider the diffraction patterns produced by rectangular and circular 
apertures. 

The amplitude of light passing through a single slit may be represented in space by the 
rectangular pulse of Figure 10.9a where d is now the width of the slit. A plane wave of 
monochromatic light, wavelength A, falling normally on a screen which contains the 
narrow slit of width d ~ A, forms a secondary system of plane waves diffracted in all 
directions with respect to the screen. When these diffracted waves are focused on to a 
second screen the intensity distribution (square of the amplitude) may be determined in 
terms of the aperture dimension d, the wavelength A and the angle of diffraction 6. 

In Figure 10.10 the light diffracted through an angle 9 is brought to focus at a point P on 
the screen PPo. Finding the amplitude of the light at P is the simple problem of adding all 
the small contributions in the diffracted wavefront taking account of all the phase 
differences which arise with variation of path length from P to the points in the slit aperture 
from which the contributions originate. The diffraction amplitude in k or wave number 
space is the Fourier transform of the pulse, width d, in x space in Figure 10.9b. The 
conjugate parameters v and t are exactly reciprocal but the product of x and k involves the term 
27 t which requires either a constant factor 1 /27r in front of one of the transform integrals or 
a common factor 1 / s/liv in front of each. This factor is however absorbed into the constant 
value of the maximum intensity and all other intensities are measured relative to it. 

The constant pulse height now measures the amplitude h of the small wave sources 
across the slit width d and the Fourier transform method is the addition by integration of 
their contributions. 

In Figure 10.10 we see that the path difference between the contribution at the centre of 
the slit and that at a point x in the slit is given by x sin 6 , so that the phase difference is 
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Source of 

monochromatic 

light 




Figure 10.10 A monochromatic plane wave normally incident on a narrow slit of width d is 
diffracted an angle 9, and the light in this direction is focused at a point P. The amplitude at P is the 
superposition of all contributions with their appropriate phases with respect to the centraL point in 
the slit. The contribution from a point x in the slit has phase cj> = 27rxsin 9/ X with respect to the 
central contribution. The phase difference from contributing points on opposite edges of the slit is 
<j> = 2nd sin 9/ A = 2a 



The product kx sin 9 can, however be expressed in a form more suitable for extension to 
two- and three-dimensional examples by writing it as k • x = klx, the scalar product of the 
vector k, giving the wave propagation direction, and the vector x, l being the direction 
cosine 



l = cos (tt/2 — 9) 
— sin# 



of k with respect to the x-axis. 

Adding all the small contributions across the slit to obtain the amplitude at P by the 
Fourier transform method gives 



m = 



i 

27T 

1 

27T 

h 



f{x) e ^ dx 
+d/2 



he 



— i klx 



dx 



-d/2 

1 



— i kl 2i r 
—2\h 



( e 



-ikld/2 +\kh 



kid 



d/2j 



—iklln Sm 2 
dh sin a 



2n a 
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where 



kid 7T 

a = — — = -flsinfl 
2 A 



The intensity I at P is given by the square of the amplitude; that is, by the product of F{k) 
and its complex conjugate F*{k), so that 

d 2 h 2 sin 2 a 
47 t 2 a 2 



where 7o, the principal maximum intensity at a — 0, (Po in Figure 10.10) is now 

d 2 h 2 

' 0= 4 ? 



The Gaussian Curve 

This curve often appears as the wave group description of a particle in wave mechanics. 
The Fourier transform of a Guassian distribution is another Gaussian distribution. 

In Figure 10.11a the Gaussian function of height h is symmetrically centred at 
time t = 0, and is given by /'(f) = h e r G , where the width parameter or standard 
deviation a is that value of t at which the height of the curve has a value equal to e 1 of its 
maximum. 

Its transform is 




Figure 10.11 (a) A Gaussian function Fourier transforms (b) into another Gaussian function 
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The integral 




dx = y/n 



and substituting, with x — ( t/a + xitvci) and d t = crdx, gives 



F{v) = hot: 1 12 



another Gaussian distribution in frequency space (Figure 10.11b) with a new height hot ; 1 /2 
and a new width parameter (ott) -1 . 

As in the case of the slit and the diffraction pattern, we see again that a narrow pulse in 
time (width a) leads to a wide frequency distribution [width (ott) 1 ]. 

When the curve is normalized so that the area under it is unity, h takes the value (ott) ^ 2 
because 



(ott 1 / 2 ) 



-t 2 !<j 2 At - 



dr = 1 



Thus, the height of a normalized curve transforms into a pulse of unit height whereas a 
pulse of unit height transforms to a pulse of width ( air) 1 . 

If we consider a family of functions with progressively increasing h values and decreasing n 
values, each satisfying the condition of unit area under their curves, we are led in the limit as the 
height h — > oo and the width o — » 0 to an infinitely narrow pulse of finite area unity which 
defines the Dirac delta ( 8 ) function. The transform of such a function is the constant unity, and 
Figures. 10.12a and b show the family of normalized Gaussian distributions and their transforms. 
Figure 10.13 shows a number of common Fourier transform pairs. 





8 function (1) 



Figure 10.12 (a) A family of normalized Gaussian functions narrowed in the limit to Dirac's delta 

function; (b) the family of their Fourier transforms 
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Figure 10.13 Some common Fourier transform pairs 



In wave mechanics the position x of a particle and its momentum p x are conjugate 
parameters and its Gaussian wave group representation may be Fourier transformed from x 
to p x space and vice versa. The Fourier Transform gives the amplitude of the wave function 
but the probability of finding the particle at x or its having a given momentum p x is 
proportional to the square of the amplitude. 
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The Dirac Delta Function, its Sifting Property and its Fourier 
Transform 

The Dirac 6 function is defined by 



8(x) = 0 at x 7 ^ 0 
= oo at x = 0 



and 

•OO 

<5(x)dx = 1 

J— OO 



i.e., an infinitely narrow pulse centred on x = 0. It is also known as the unit impulse 
function. 

A valuable characteristic is its sifting property, that is 



•OO 

8(x- x 0 )/(x)dx =f(x o) 

— OO 

The Fourier Transform of 6(x — xq) — e lkx ° because by definition 



F(8(x-x q)) 



•OO 

8{x — xo)e~ tla dx 

— OO 



so writing /(x) = e tkx and applying the sifting property gives /(xq) = e ,kx ° . Note that 
e -ifao — e ‘kx 0 — j f or Xq — o. 

From the form of the transform we see that if a function /(x) is a sum of individual 
functions then the Fourier Transform F(f(x)) is the sum of their individual transforms. 
Thus, if 

/(*) = Y1 6 ( x ~ ■*/) 

j 

then 



Ff(x) = Y,e~ ikXi 

j 

Figure 10.14 shows two Dirac 8 functions situated at x = ±| so that /(x) = 
8(x — f) + 8(x + |) giving F(/(x)) — e' k % + e lk i = 2coska/2. 



Convolution 

Given two functions /(x) and h (x ) , their convolution, written 



/(x) <8> h{x) 



• OO 

f(x)h(x) dx 



— OO 
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f 



1 1 1 ! ► X 

— 3/ 0 + 3/ 

/ 2 / 2 



Figure 10.14 The Fourier transform of two Dirac <5 functions located at x = ±a/2 is 2 cos ka/2 



is the overlap area under the product of the two functions as one function scans across the 
other. It the functions are two dimensional, /(x, y) and h(x. y), their convolution is the 
volume overlap under their product. 

To illustrate a one-dimensional convolution consider the rectangular pulse of length D in 
Figure 10.15 convolved with an identical pulse. This is known as self-convolution. The 
convolution will be the sum of the shaded areas such as that of Figure 10.15a as one pulse 
slides over the other. We can see that the base length of the resulting convolved pulse will 
be 2D and that it will be symmetric about its peak, that is, when the two pulses completely 
overlap. If we consider the left-hand pulse as an infinite series of 6 functions, of which we 
show a few, then Figure 10.15b shows that the integrated sum is an isosceles triangle of 
base length 2D. 

Another example is the convolution of a small triangular pulse with a rectangular pulse 
length D , Figure 10.16. Again, we use the series of 5 functions to show the sum of the 
components of the resulting convolution and its integrated form for an infinite series of 5 
functions. The length of the final pulse is again the sum of the lengths of the two pulses. 

Such a pulse would result in the convolution of a rectangular pulse with an exponential 
time function, for example, when a rectangular pulse is passed into an integrating network 
formed by a series resistance and parallel condenser. Figure 10.17. Here, the exponential 
time function of the network may be considered as fixed in time while the pulse performs 
the scanning operation. Note in Figures 10.15, 10.16 and 10.17 that the component 
contributions of the left hand pulses are summed in reverse order. This is explained in the 
discussion following eq. 10.2. 

A convolution /(x) 0 h (x) is generally written in the form 



S(x) 



•OO 

f(x)h(x' — x)dx 

J — OO 



( 10 . 2 ) 



This a particularly relevant form when we consider the Optical Transfer Function on page 
391. There, x is an object space coordinate and x' is an image space coordinate so the 
convolution relates image to object. If the function h(x' — x) is a localized pulse in 
the object space and x J lies within it on the object axis x then the pulse h(x' — x) is reversed 
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Convolution 




Figure 10.15 (a) A convolution is the integral of all overlapping areas as one function scans 

another. A rectangular pulse length D scans an identical pulse and the overlap area is shaded at one 
point of the scanning, (b) The scanning pulse is represented by several Dirac 5 (impulse) functions 
and the component overlap areas are summed. When the number of impulse functions is large the 
sum of the components is integrated to become the triangular pulse 



in image space (axis x 1 ) so that its trailing edge becomes its leading edge. Figure 10.18(a) 
shows the pulse on the object axis and Figure 10.18(b) shows the reversed pulse on the 
image axis. 

The product f(x) h(x' — x) exists only where the functions overlap and in Fig- 
ure 10.18(b) g(x' } ) is the superposition of all the individual overlapping contributions that 
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Figure 10.16 The convolution of a triangular with a rectangular pulse using the method of Figure 
10.15 

• WW — | — 

_ * . Tl 

t = 0 t = t-! t = 0 1 = 1! 

Figure 10.17 The convolution of Figure 10.16 is the same as that of a rectangular electrical puLse 
passing through an integrating circuit formed by a series resistance and a parallel condenser 




exist at y! v The contribution to g(x\ ) at x\ by x\ and dr at X\ is f(x\ )h(-\\ — x \ ) dr where 
f(x i) is a number which magnifies the pulse of Figure 10.18(b) to become the pulse of 
Figure 10.18(c). Each value of x in the overlap region makes a contribution to g(x' l ); x 
values beyond the overlap make no contribution. The contributions begin when the leading 
edge of h(xf — x) reaches x' l and they cease when its trailing edge passes x! v 
Note that by changing the variable x!' = x! — x in Equation (10.2). 

f ®h — h <g )/ 

This result is also evident when we consider the Convolution Theorem in the next section. 
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(a) 




(b) 




(c) 

Figure 10.18 The function h(x' — x) in the object space is reversed in the image space in Figure 
10.18(b). (b) The convolution g(x[) is the superposition of all individual overlapping contributions to 
f(x)h(x ' — x) that exist atxj. (c) The contribution made by/(xi)dx to g(x[) wher e/(xi) is a number 
which magnifies h(x , 1 — x) 
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Returning to the convolution of the rectangular pulses in Figure 10.15 and taking the 
left-hand pulse as f(x) each impulse x, of the infinite series sweeps across the right-hand 
pulse h{x! — x) to give the triangular convolution g(x'). If the left-hand pulse is now 
h(x' — x) sweeping across the right-hand pulse fix) with x' as a fixed location in h (x 1 — x), 
the series of overlaps, as x- moves across /(x), gives the same triangular convolution. 

The Convolution Theorem 

The importance of the convolution process may be seen by considering the following. 

When a signal, electrical or optical, passes through a system such as an amplifier or a 
lens, the resulting output is a function of the original signal and the system response. We 
have seen that a slit, in passing light from an optical source, may act as an angular filter, 
restricting the amount of information it passes and superimposing its own transform on the 
radiation passing through. An electrical filter can behave in a similar fashion. 

Effectively there are two transformations, one into the intermediate system and one out 
again. 

A convolution reduces this to a single transformation. The transform of the intermediate 
system is applied to the orginal function or signal and the resulting output is the integrated 
product of each point operating on the transformed response. 

The convolution theorem states that the Fourier transform of the convolution of two 
functions is the product of the Fourier transforms of the individual functions, that is, if 

g(x') =f(x)®h(x) 

then 



F(g)=F(f®h)=F(f)-F(h) 

The proof is straightforward. 

The convolution g(x') is a function of k, so its transform is 
F(g) = G(k ) = [°° g(x')e-‘ 7 “V 

O 

f(x)h(x — x)dx 

-oo 
o 

h(x — x)e~ ,kxl dx' 

-oo 

Putting x' — x = y gives dy = dx' and e ,kxJ = Q -iky Q -ikx an( j so 



e-'^dx' 



/(•*) dx 



F(g) = G(k) = 



/(x)e ,kx dx 



h(y)e ,A:v dy 



= F(f) ■ F(h) = F(h) ■ F{f) 



We can use this result to find the Fourier Transform of the resulting triangular pulse in 
Figure 10.15(b). The slit may be seen as a rectangular pulse of width d and its Fourier 
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Transform on page 288 gave its diffraction pattern as cx sin a /a where a = kid/ 2. Each of 
the pulses in Figure 10.15(b) contributes a Fourier Transform cx sin /3/P where 



P = 



klD 

~Y 



so the Fourier Transform of the isosceles triangular pulse is cx sin 2 /3//3 2 . 

Note that the analysis above is equally true if the arguments of the two functions are 
exchanged under the convolution process so that we ha ve/(x' — x) and h (x ) . We use this in 
the discussion on the Optical Transfer Function on page 393. 



(Problems 10.10, 10.11, 10.12, 10.13, 10.14, 10.15, 10.16, 10.17, 10.18, 10.19) 



Problem 10.1 

After inspection of the two wave forms in the diagram what can you say about the values of the 
constant, absence or presence of sine terms, cosine terms, odd or even harmonics, and range of 
harmonics required in their Fourier series representation? (Do not use any mathematics.) 




t 




Problem 10.2 

Show that if a periodic waveform is such that each half-cycle is identical except in sign with the 
previous one, its Fourier spectrum contains no even order frequency components. Examine the result 
physically. 

Problem 10.3 

A half-wave rectifier removes the negative half-cycles of a pure sinusoidal wave y = h sinx. Show 
that the Fourier series is given by 

h 7T . 2 „ 2 , 2 ^ \ 

y = — 1 -I sinx cos2x cos4x cosox. . . 

7r V 1-2 1-3 3-5 5-7 ) 
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Problem 10.4 

A full-wave rectifier merely inverts the negative half-cycle in Problem 10.3. Show that this doubles 
the output and removes the undesirable modulating ripple of the first harmonic. 

Problem 10.5 

Show that f{x) = x 1 may be represented in the interval ± ty by 

f(x) = \ n 2 + V(-l)"A C os«* 
j z J n L 



Problem 10.6 

Use the square wave sine series of unit height f(x) = 4/7r(sin.r + i sinSv + J; sin5v) to show that 

1 - i + i - i = tt/4 



Problem 10.7 

An infinite train of pulses of unit height, with pulse duration 2 r and a period between pulses of T, is 
expressed as 



f(t) =0 for — * T < t < — t 
= 1 for — t < t < T 
= 0 for r < r < 1 7 



and 



f(t+T)=f(t) 

Show that this is an even function with the cosine coefficients given by 

2 . 2tt 

a I, = — sin — nr 
rnr T 



Problem 10.8 

Show, in Problem 10.7, that as r becomes very small the values of a„ — > 4 t/T and are independent 
of n, so that the spectrum consists of an infinite set of lines of constant height and spacing. The 
representation now has the same form in both time and frequency; such a function is called ‘self 
reciprocal’. What is the physical significance of the fact that as r — > 0, a„ — > 0? 

Problem 10.9 

The pulses of Problems 10.7 and 10.8 now have amplitude 1 /2r with unit area under each pulse. 
Show that as r — > 0 the infinite series of pulses is given by 

f(t) = - + - cos 2-imt/T 

n = 1 

Under these conditions the amplitude of the original pulses becomes infinite, the energy per pulse 
remains finite and for an infinity of pulses in the train the total energy in the waveform is also 
infinite. The amplitude of the individual components in the frequency representation is finite, 
representing finite energy, but again, an infinity of components gives an infinite energy. 
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Problem 10.10 

The unit step function is defined by the relation 

fit) = 1 it > 0) 

= 0 (f < 0) 

* 

; f(t) 



i 

0 t"-^ 

This is a very important function in physics and engineering, but it does not satisfy the criteria for 
Fourier representation because its integral is not finite. A similar function of finite period will satisfy 
the criteria. If this function is defined 



fit) = 1(0 < t < T) 

= 0 elsewhere 



show that if the transform 



F(oj) 



“OO 

/(r)e- iw 'dr = 

— OO 

— [1 -e iwr l 

ICO 



T 



JO 



Q~ iujt dt 



then 






l l 

~2 + 2n 



F(uj)O u1 &lo 

OO | 

— e irf d UJ 

-no 1W 



(use the fact that for T very large 



— e i “( f - r) d lo = 



— e dcu = —7 r 

tw 



Note that the integral for the second term of /(f) gives — 7r for t < 0 and +7r for t > 0. This spectral 
representation is shown in Figure 10.13.) 



Problem 10.11 

Optical wave trains emitted by radiating atoms are of finite length and only an infinite wave train 
may be defined in terms of one frequency. The radiation from atoms therefore has a frequency 
bandwidth which contributes to the spectral linewidth. The random phase relationships between 
these wave trains create incoherence and produce the difficulties in obtaining interference effects 
from separate sources. 
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Let a finite length monochromatic wave train of wavelength Ao be represented by 



f(t) =/oe iW 



and be a cosine of constant amplitude /o extending in time between ±r/2. The distance l = cr is 
called the coherence length. This finite train is the superposition of frequency components of 
amplitude F(v) where the transform gives 



/« 



•OO 

F{v)C lm, dv 

J — OO 



so that 



F(v) 



“OO 

f{t')e~ i 2 mt ' dt' 

J — OO 
* +t/2 

f oe -i2n(v-v 0 ) t ' df , 

-t/2 



Show that 



F{v) =f 0 T 



sin[7r(^ — i/o )r] 
- vq )t 



and that the relative energy distribution in the spectrum follows the intensity distribution curve in a 
single slit diffraction pattern. 



Problem 10.12 

Show that the total width of the first maximum of the energy spectrum of Problem 10.11 has a 
frequency range 2/S.v which defines the coherence length / of Problem 10.11 as Ag/AA. 

Problem 10.13 

For a ruby beam the value of Au in Problem 10.12 is found to be 10 4 Hz and Ao = 6.936 x 10~ 7 m. 
Show that AA = 1.6 x 10~ 17 m and that the coherence length l of the beam is 3 x 10 4 m. 

Problem 10.14 

The energy of the finite wave train of the damped simple harmonic vibrations of the radiating atom 
in Chapter 2 was described by E = E oe"““'' e . Show from physical arguments that this defines a 
frequency bandwidth in this train of A u about the frequency u> o, where the quality factor 
Q = wo/ Aw. (Suggested line of argument — at the maximum amplitude all frequency components 
are in phase. After a time r the frequency component w o has changed phase by u>qt. Other 
components have a phase change which interfere destructively. What bandwidth and phase change is 
acceptable?) 

Problem 10.15 

Consider Problem 10.14 more formally. Let the damped wave be represented as a function of time by 

f(t) =/ 0 e i 2 ,n/ 0 , e- ( / r 

where /o is constant and r is the decay constant. 
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Fourier Methods 



Use the Fourier transform to show that the amplitudes in the frequency spectrum are given by 



m = T 



_Jo 

/t+ i 27 T (v— Vq) 



Write the denominator of F( v) as rc'° to show that the energy distribution of frequencies in the 
region of v — vq is given by 



I^MI 2 = 4 = - 



/o 2 



| 2 _f_l_ 

r 2 (l/r) 2 + (lo-loq) 2 



Problem 10.16 

Show that the expression |F(i/)| 2 of Problem 10.15 is the resonance power curve of Chapter 3; show 
that it has a width at half the maximum value (/or) 2 which gives Av = 1/ttt, and show that a 
spectral line which has a value of A A in Problem 10.12 equal to 3 x 10 ~ 9 m has a finite wave train of 
coherence length equal to 32 x 10 _6 m (32 pm) if Ao = 5.46 x 10~ 7 m. 

Problem 10.17 

Sketch the self-convolution of the double slit function shown in Figure Q 10.17. 




Figure Q.10.17 

Problem 10.18 

Sketch the convolution of the two functions in Figure Q 10.18 and use the convolution theorem to 
find its Fourier transform. 

























< 0 > 


< 0 





Figure Q.10.18 




The Convolution Theorem 
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Problem 10.19 

The convolution of two identical circles of radius r is very important in the modern method of testing 
lenses against an ideal diffraction limited criterion. 

In Figure Q 10.19 show that the area of overlap is 




A = r 2 (29 — 2sin 6 cos 6) 

and show for 



that the convolution 



R<2r 



O(R) 



r 2 2 cos 



R 

2r 




l 

1 R 
2 r 



Sketch 0(R) for O < R < 2 r 

Apart from a constant the linear operator O is known as the modulation factor of the 
optical transfer function. 



Summary of Important Results 

Fourier Series 

Any function may be represented in the interval ± it by 

\ n n 

fix) = -a o + E a n cos nx + b n sin nx 

2 l t 
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Fourier Methods 



where 

1 f 2 ^ 

a, t = — f(x) cos nx Ax 

K Jo 

and 

1 f 2?r 

b n = — f{x) sin nxdx 
it Jo 



Fourier Integral 

A single non-periodic pulse may be represented as 

*+oo r p+oo 

f(t)= f(t')e~ i2mt ' At' e i2mt A v 

— OO L — OO 



or as 



where 




f(t) and F(y) are Fourier Transforms of each other. When t is replaced by x and o by k the 
right hand side of each transform has a factor l/s/ltr. The Fourier Transform of a 
rectangular pulse has the shape of sin a/a. (Important in optical diffraction.) 





11 

Waves in Optical Systems 

Light. Waves or Rays? 

Light exhibits a dual nature. In practice, its passage through optical instruments such as 
telescopes and microscopes is most easily shown by geometrical ray diagrams but the fine 
detail of the images formed by these instruments is governed by diffraction which, together 
with interference, requires light to propagate as waves. This chapter will correlate the 
geometrical optics of these instruments with wavefront propagation. In Chapter 12 we shall 
consider the effects of interference and diffraction. 

The electromagnetic wave nature of light was convincingly settled by Clerk-Maxwell in 
1864 but as early as 1690 Huygens was trying to reconcile waves and rays. He proposed 
that light be represented as a wavefront, each point on this front acting as a source 
of secondary wavelets whose envelope became the new position of the wavefront. 
Figure 11.1(a). Light propagation was seen as the progressive development of such a 
process. In this way, reflection and refraction at a plane boundary separating two optical 
media could be explained as shown in Figure 11.1(b) and (c). 

Huygens’ theory was explicit only on those contributions to the new wavefront directly 
ahead of each point source of secondary waves. No statement was made about propagation 
in the backward direction nor about contributions in the oblique forward direction. Each of 
these difficulties is resolved in the more rigorous development of the theory by Kirchhoff 
which uses the fact that light waves are oscillatory (see Appendix 2, p. 547). 

The way in which rays may represent the propagation of wavefronts is shown in 
Figure 11.2 where spherically diverging, plane and spherically converging wavefronts are 
moving from left to right. All parts of the wavefront (a surface of constant phase) take the 
same time to travel from the source and all points on the wavefront are the same optical 
distance from the source. This optical distance must take account of the changes of 
refractive index met by the wavefront as it propagates. If the physical path length is 
measured as x in a medium of refractive index n then the optical path length in the medium 
is the product nx. In travelling from one point to another light chooses a unique optical path 
which may always be defined in terms of Fermat’s Principle. 



The Physics of Vibrations and Waves, 6th Edition H.J.Pain 
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Figure 11.2 Ray representation of spherically diverging, plane and spherically converging 
wavefronts 

Fermat's Principle 

Fermat’s Principle states that the optical path length has a stationary value; its first order 
variation or first derivative in a Taylor series expansion is zero. This means that when an 
optical path lies wholly within a medium of constant refractive index the path is a straight 
line, the shortest distance between its end points, and the light travels between these points 
in the minimum possible time. When the medium has a varying refractive index or the path 
crosses the boundary between media of different refractive indices the direction of the path 
always adjusts itself so that the time taken between its end points is a minimum. Fermat’s 
Principle is therefore sometimes known as the Principle of Least Time. Figure 1 1.3 shows 
examples of light paths in a medium of varying refractive index. As examples of light 
meeting a boundary between two media we use Fermat's Principle to derive the laws of 
reflection and refraction. 

The Laws of Reflection 

In Figure 1 1 ,4a Fermat’s Principle requires that the optical path length OSI should be a 
minimum where O is the object, S lies on the plane reflecting surface and I is the point on 
the reflected ray at which the image of O is viewed. The plane OSI must be perpendicular 
to the reflecting surface for, if reflection takes place at any other point S' on the reflecting 
surface where OSS' and ISS' are right angles then evidently OS'>OS and IS'>IS, giving 
OS 'I > OSI. 

The laws of reflection also require, in Figure 1 1 .4a that the angle of incidence i equals 
the angle of reflection r. If the coordinates of O, S and I are those shown and the velocity of 
light propagation is c then the time taken to traverse OS is 



r = U 2 + r)W 
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Rare atmosphere 




Cool Air 




Apparent 

reflecting ~ ~ ~ 

surface 



(b) 

Figure 11.3 Light takes the shortest optical path in a medium of varying refractive index, (a) A 
light ray from the sun bends towards the earth in order to shorten its path in the denser atmosphere. 
The sun remains visible after it has passed below the horizon, (b) A light ray avoids the denser 
atmosphere and the road immediately below warm air produces an apparent reflection 




(a) 




Figure 11.4 The time for light to follow the path OSI is a minimum (a) in reflection, when OSI 
forms a plane perpendicular to the reflecting surface and i = r; and (b) in refraction, when 
nsin; = n'sinr' (Snell's Law) 




The Law of Refraction 
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and the time taken to traverse SI is 

t'=[(X-x) 2 +y 2 ] l/2 /c 

so that the total time taken to travel the path OSI is 

T = t + f 



The position of S is now varied along the x axis and we seek, via Fermat's Principle of 
Least Time, that value of x which minimizes T, so that 



But 



dT 



X-x 



d* c(x 2 + y 2 )^ 2 c[(X — x) 2 + y 2 ] l ^ 2 



= 0 



(x 2 + y 2 ) ^ 2 



= sini 



and 



Hence 



and 



X-x 



[(X-x) 2 +y 2 ] 1 / 2 



= sinr 



sin i = sin r 



i = r 



The Law of Refraction 

Exactly similar arguments lead to Snell’s Law, already derived on p. 256. 

Here we express it as 

n sin i = n' sin r' 

where i is the angle of incidence in the medium of refractive index n and r' is the angle of 
refraction in the medium of refractive index n'(n' > n) . In Figure 1 1.4b a plane boundary 
separates the media and light from O (0, y) is refracted at S (x, 0) and viewed at I (X, Y) on 
the refracted ray. If v and v' are respectively the velocities of light propagation in the media 
n and n' then OS is traversed in the time 

t={x 2 +y 2y/2 /v 



and SI is traversed in the time 



t' = [(X — x ) 2 + T 2 ] 1/2 /V 
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The total time to travel from O to I is T = t + t' and we vary the position of S along the x 
axis which lies on the plane boundary between n and n', seeking that value of x which 
minimizes T. So 

dT _ 1 x _ 1 (X-x) _ 

dx v (x 2 + y 2 ) 1/2 V [(X-x) 2 + F 2 ] 1/2 

where 



and 



x 

(x 2 + y 2 ) ^ 2 



= sin i 



But 



and 



Hence 



(X-x) 

[{X-x) 2 + Y 2 ] 1/2 

1 n 

v c 

1 

v' c 

n sin i = n' sin r' 



Rays and Wavefronts 

Figure 11.2 showed the ray representation of various wavefronts. In order to reinforce the 
concept that rays trace the history of wavefronts we consider the examples of a thin lens 
and a prism. 

The Thin Lens 

In Figure 11.5a plane wave in air is incident normally on the plane face of a piano convex 
glass lens of refractive index n and thickness d at its central axis. Its spherical face has a 
radius of curvature d. The power of a lens to change the curvature of a wavefront is 
the inverse of its focal length/. A lens of positive power converges a wavefront, negative 
power diverges the wavefront. 

Simple rays optics gives the power of the piano convex lens as 

but we derive this result from first principles that is, by considering the way in which the 
lens modifies the wavefront. 




Rays and Wavefronts 
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Figure 11.5 A plane wavefront is normally incident on a plano-convex Lens of refractive index n 
and thickness d at the centraL axis. The radius of the curved surface R^> d. The wavefront is a surface 
of constant phase and the optical path Length is the same for each section of the wavefront. At a 
radius r from the centraL axis the wavefront travels a shorter distance in the denser medium and the 
Lens curves the incident wavefront which converges at a distance R/(n — 1) from the Lens 



At the central axis the wavefront takes a time t = nd /c to traverse the thickness d. At a 
distance r from the axis the lens is thinner by an amount r 2 /2R (using the elementary 
relation between the sagitta, arc and radius of a circle) so that, in the time t — nd /c, points 
on the wavefront at a distance r from the axis travel a distance 

(d - r 2 /2R) 

in the lens plus a distance [r 2 j2R + z) in air as shown in the figure. Equating the times 
taken by the two parts of the wave front we have 

nd/c = ( n/c)(d — r 2 /2R) + (1 /c){z + r 2 /2R) 

which yields 

z— (n— \)r 2 /2R 

But this is again the relation between the sagitta z, its arc and a circle of radius R/{n— 1) 
so, in three dimensions, the locus of z is a sphere of radius R / ( n — 1) and the emerging 
spherical wavefront converges to a focus at a distance 

/ - R/(n ~ 1) 



(Problems 11.1, 11.2, 11.3) 

The Prism 

In Figure 1 1.6 a section, height y, of a plane wavefront in air is deviated through an angle 0 
when it is refracted through an isosceles glass prism, base l, vertex angle 3 and refractive 
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Central 

vertical 

axis 



i 
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Figure 11.6 A plane wavefront suffers minimum deviation (0 m j n ) when its passage through a prism 
is symmetric with respect to the central vertical axis (7 = i 1 ). The wavefront obeys the OpticaL 
Helmholtz Condition that ny tan a is a constant where n is the refractive index, y is the width of the 
wavefront and a is shown. (Here a = a ') 



index n. Experiment shows that there is one , and only one, value of the incident angle i for 
which the angle of deviation is a minimum = 6 m j n . It is easily shown using ray optics that 
this unique value of i requires the passage of the wavefront through the prism to be 
symmetric about the central vertical axis as shown in the figure so that the incident angle i 
equals the emerging angle Equating the lengths of the optical paths AVA' and BB'(= nl) 
followed by the edges of the wavefront section gives the familiar result 



sin 




. P 

n sin — 
2 



which is used in the standard experiment to determine n, the refractive index of the prism. 

Now there is only one value of i which produces minimum deviation and this leads us to 
expect that the passage of the wavefront will be symmetric about the central vertical axis 
for if a plane mirror (M in the figure) is placed parallel to the emerging wavefront the 
wavefront is reflected back along its original path, and if i ^ i 1 there would be two values 
of incidence, each producing minimum deviation. At i for minimum deviation any rotation 
increases i’ . 




Ray Optics and Optical Systems 
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However, the real argument for symmetry from a wavefront point of view depends on the 
optical Helmholtz equation which we shall derive on p. 321. This states that for a plane 
wavefront the product ny tan a remains constant as it passes through an optical system 
irrespective of the local variations of the factors n , y and tan a. Now the wavefront has the 
same width on entry into and exit from the prism so y — y' and although n changes at the 
prism faces the initial and final medium for the wavefront is air where n — 1 . 

Hence, from the optical Helmholtz equation tana = tana' in Figure 11.6. It is evident 
that as long as its width y = y' the wavefront section will turn through a minimum angle 
when the physical path length BB' followed by its lower edge is a maximum with respect 
to AVA', the physical path length of its upper edge. 



Ray Optics and Optical Systems 

An optical system changes the curvature of a wavefront. It is formed by one or more optical 
surfaces separating media of different refractive indices. In Fig. 1 1.7 rays from the object 
point Lq pass through the optical system to form an image point L' . When the optical 
surfaces are spherical the line joining Lq and L', which passes through the centres of 
curvature of the surfaces, is called the optical axis. This axis cuts each optical surface at its 
pole. If the object lies in a plane normal to the optical axis its perfect image lies in a 
conjugate plane normal to the optical axis. Conjugate planes cut the optical axis at 
conjugate points, e.g. Lq and L' . In Figure 11.7 the plane at +oo has a conjugate focal 
plane cutting the optical axis at the focal point F. The plane at — oo has a conjugate focal 
plane cutting the optical axis at the focal point F'. 

Paraxial Rays 

Perfect geometrical images require perfect plane and spherical optical surfaces and in a real 
optical system a perfect spherical optical surface is obtained by using only that part of the 
wavefront close to the optical axis. This means that all angles between the axis and rays are 
very small. Such rays are called paraxial rays. 
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Figure 11.7 Optical system showing direction of incident light from left to right and opticaL 
surfaces of positive and negative curvature. Rays from L 0 pass through L' and this defines L 0 and L' 
as conjugate points. The conjugate point of F is at +oo, the conjugate point of F' is at — oo 
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Figure 11.8 Sign convention for Lengths is Cartesian measured from the right angles at 0. Angles 
take the sign of their tangents. 0 is origin of measurements 



Sign Convention 

The convention used here involves only signs of lengths and angles. The direction of 
incident light is positive and is always taken from left to right. Signs for horizontal and 
vertical directions are Cartesian. If AB = l then BA = — /. The radius of curvature of a 
surface is measured from its pole to its centre so that, in Figure 11.7, the convex surface 
presented to the incident light has a positive radius of curvature and the concave surface 
has a negative radius of curvature. 

The Cartesian convention with origin O at the right angles of Figure 11.8 gives the angle 
between a ray and the optical axis the sign of its tangent. 

If the angle between a ray and the axis is a then, for paraxial rays 

sin a — tan a = a 



so that Snell’s Law of Refraction 



n sin; = n' sin r' 



becomes 



ni = nr' 



Power of a Spherical Surface 

In Figure 1 1.9(a) and (b) a spherical surface separates media of refractive indices n and n' . 
Any ray through Lq is refracted to pass through its conjugate point L' . The angles are 
exaggerated so that the base of h is very close to the pole of the optical surface which is 
taken as the origin. In Figure 11.9(a) the signs of R, /' and a' are positive with / and a 
negative. In Figure 11.9(b) R, l, V , a and a' are all positive quantities. In both figures 
Snell’s Law gives 



ni = n'r' 




Power of a Spherical Surface 
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R, /', a', are positive 





(b) 

Figure 11.9 Spherical surface separating media of refractive indices n and n' . Rays from L 0 pass 
through L' . Snell's Law gives the power of the surface as 

„ n' n n' — n 



i.e. 

/ 1{6 — a) = n'(9 — a') 



( 11 . 1 ) 



( 11 . 2 ) 

where PP is the power of the surface. For n' > n the power 3P is positive and the surface 
converges the wavefront. For n' < n. PP is negative and the wavefront diverges. When the 
radius of curvature R is measured in metres the units of SP are dioptres. 
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Magnification by the Spherical Surface 

In Figure 11.10 the points QQ' form a conjugate pair, as do LqL' . The ray QQ' passes 
through C the centre of curvature, L () <2 is the object height y, L'Q' is the image height y' so 

ni — n'r' 



gives 

ny/l = n'y' /l' 
or 

nyh/l = n'y'h/l' 

that is 

nya — n'y'a 1 (11.3) 

This is the paraxial form of the optical Helmholtz equation. 

The Transverse Magnification is defined as 

Mj = y' jy = nV /n'l. 

The image y' is inverted so y and y' (and / and /') have opposite signs. 

The Angular Magnification is defined as 

M a = a' /a 

Note that 

Mj — n/n' M a 



which, being independent of y, applies to any point on the object so that the object in the 
plane LqQ is similar to the image in the plane L'Q' . 



Q 




Figure 11.10 Magnification by a spherical surface. The paraxial form of the optical HelmhoLtz 
equation is nya = n'y' a' so Transverse Magnification Mj = y’ /y = nf /In' Angular Magnification 
M a = a 1 /a. Note that the image is inverted so y and y' (and / and /') have opposite signs 
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A series of optical surfaces separating media of refractive indices n, n'n" yields the 
expression 



nya = n'y'a' = n"y"a" 

which is the paraxial form of the optical Helmholtz equation. 



Power of Two Optically Refracting Surfaces 

If Figure 11.11 the refracting surfaces have powers SP\ and # 2 > respectively. At the first 
surface equation (11.1) gives 

n i«i — na = 



and at the second surface 



n'a' — n\a\ = 



Adding these equations gives 

n'a' — na — 3P\h\ + ^ 2^2 



If the object is located at —00 so that a = 0 we have 



n'a' — 2?\li\ + ^V *2 



a' = 1 r ( 0> l h l +&>2h 2 ) 

n 




Figure 11.11 Two optically refracting surfaces of power 0 \ and 0*2 have a combined power of 

» = {?\h x +P 2 h 2 ) 
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This gives the same image as a single element of power 3? if 

a' =-(& l h l +0> 2 h 2 ) = 1-Phi 
n' n' 

where 

& = l-(0 , ih l +& 2 h2) (11.4) 

h\ 

is the total power of the system. This is our basic equation and we use it first to find the 
power of a thin lens in air. 



Power of a Thin Lens in Air (Figure 11.12) 

Equation (11.2) gives 



n ' n n' — n 




for each surface, so that in Figure 11.12 

3V=(*i - l)/*i 



and 

0 f 2 = (l~n 1 )/R 2 



From equation (11.4) 



with 



P = l-(0 , ihi+&2h2) 

h l 



h i = h 2 




Ri R z 



Figure 11.12 

power 



A thin Lens of refractive index n lf and radii of surface curvatures R i and R 2 has a 



3P = (n i 




1 



f 



where/' is the focal length. In the figure R\ is positive and R 2 is negative 




Power of a Thin Lens in Air (Figure 11.12) 
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we have 

0> = g> x +^ 2 

so the expression for the thin lens in air with surfaces of power 3P\ and IP 2 becomes 




where/' is the focal length. 

Applying this result to the piano convex lens of p. 31 1 we have R 1 — 00 and R 2 negative 
from our sign convention. This gives a positive power which we expect for a converging 
lens. 



Effect of Refractive Index on the Power of a Lens 

Suppose, in Figure 11.13, that the object space of the lens remains in air ( n = 1) but that 
the image space is a medium of refractive index n' 2 / 1. How does this affect the focal 
length in the medium n ' 2 ? 

If 3P is the power of the lens in air we have 



and for 



we have 



n' 2 o' — not = LPh \ 



a = 0 



(11.5) 



o' — PPh\/n' 2 = h\/n' 2 f' 

where /' is the focal length in air. 

If f 2 is the focal length in the medium n' 2 then 

/V = /n 




Figure 11.13 The focal Length of a thin lens measured in the medium n' 2 is given by f z = n' 2 f 
where/' is the focal length of the lens measured in air 
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so 

«' = h i if 2 = hi/n' 2 f 

giving 

fi = n 2 f 

Thus, the focal length changes by a factor equal to the refractive index of the medium in 
which it is measured and the power is affected by the same factor. 

If the lens has a medium no in its object space and a medium n, in its image space then 
the respective focal lengths fo and /,■ in these spaces are related by the expression 

fi/fo — — n i/ n Q (11.6) 

where the negative signs shows that/o and/, are measured in opposite directions (fo is 
negative and/, is positive). 

Principal Planes and Newton's Equation 

There are two particular planes normal to the optic axis associated with every lens element 
of an optical system. These planes are called principal planes or unit planes because 
between these planes there is unit transverse magnification so the path of every ray between 
them is parallel to the optic axis. Moreover, any complex optical system has two principal 
planes of its own. In a thin lens the principal planes coincide. 

The principal planes of a single lens do not, in general, coincide with its optical surfaces; 
focal lengths, object and image distances are measured from the principal planes and not 
from the optical surfaces. In Figure 11.14, PH and P'H' define the first and second 



First focal plane Second focal plane 

conjugate to +<*>—► •<— conjugate to 




Figure 11.14 Between the principal pLanes PH and P'H ' of a lens or Lens system there is unit 
magnification and rays between these pLanes are parallel to the optic axis. Newton's equation defines 
xx 1 =//'. The optical Helmholtz equation is nya= constant for paraxial rays and nytana = 
constant for rays from oo 




Optical Helmholtz Equation for a Conjugate Plane at Infinity 



321 



principal planes, respectively, of a lens or optical system and PF and P'F' are respectively 
the first and second focal lengths. The object and image planes cut the optic axis in Lq and 
Lq, respectively. 

The ray LH parallel to the optic axis proceeds to H' and thence through F' the second 
focal point. The rays LH and H'F' meet at H' and therefore define the position of the 
second principal plane, P'H' . The position of the first principal plane may be found in a 
similar way. 

If Figure 11.14, the similar triangles FLqL and FPQ give y/y' — x/f where, measured 
from P, only y is algebraically positive. The similar triangles F'L' 0 L' and F'P"H' give 

y/y' =f'/x', 

where, measured from P' , only y' is algebraically negative. 

We have, therefore, 

x/f =f'/x\ 

where x and /'are negative and x' and/' are positive. 

Thus, 

xx =//' 

This is known as Newton’s equation. 

If /, the object distance, and /', the image distance, are measured from the principal 
planes as in Figure 11.14, then 

l =f + x and /' = /' + x 



and Newton’s equation gives 



xx' = (/ -/)(/' -/') = W - I'f - If +//' =//' 



so that 



But from nf = —n'f (equation (11.6)) we have 

n' n n' —n 

T 7 ~l=f = T =J 

the power of the lens. 

Optical Helmholtz Equation for a Conjugate Plane at Infinity 

Suppose now that the object is no longer located at LqL but at infinity so that the ray LH 
originates at one point from the distant object while the ray LFQ comes from a point on the 
object much more distant from the optic axis. 
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We still have from triangle F'P'H' that 

y =/' tana' 



and from triangle FPQ that 



so 



y' = /tana 



But 



so 



/ tan a y 
/'tana' y' 



and 



/ 

/' 



y tana 



y'tana' 



L = zl 

f n' 



ny tan a = —n'y' tan a' 

(Note that a, a' and y' are negative.) 

This form of the Helmholtz equation applies when one of the conjugate planes is at 
infinity and is to be compared with the general unrestricted form of the Helmholtz equation 
for paraxial rays 



nya — n'y' a' 

The infinite conjugate form ny tan a = constant is valid when applied to the prism of p. 3 12 
because the plane wavefront originated at infinity. 

(Problems 11.4, 11.5, 11.6, 11.7, 11.8) 

The Deviation Method for (a) Two Lenses and (b) a Thick Lens 

Figure 11.11 illustrated how the deviation of a ray through two optically refracting surfaces 
could be used to find the power of a thin lens. We now apply this process to (a) a 
combination of two lenses and (b) a thick lens in order to find the power of these systems 
and the location of their principal planes. We have already seen in equation (11.5), which 
may be written 



n\a! — n\a — PP\y (11.7) 

where PP , is the power of the first lens in Figure 1 1.15a or the power of the first refracting 
surface in Figure 1 1 . 15b. If the incident ray is parallel to the optic axis, then a = 0 and we 
have 



n\ot' = PP \y i 



(11.8) 
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(b) 

Figure 11.15 Deviation of a ray through (a) a system of two Lenses and (b) a single thick Lens. P' is 
a principal plane of the system. All the significant optical properties may be derived via this method 



At the second lens or refracting surface 



niOLi — n\a\ 



so 



n' 2 a' 2 - n\a\ = SP 2 y 2 



(11.9) 



Equation (11.8) plus equation (11.9) gives 

n 2 a 2 = SPyyx + SP 2 yi (11.10) 

Now the incident ray strikes the principal plane P' at a height y \ so, extrapolating the ray 
from F' , the focal point of the system, through the plane P' 2 to the plane P', we have 

n' 2 a' 2 — SPy\ (11.11) 



where SP is the power of the complete system. 
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From equations (11.10) and (11.11) we have 

&y\ = &\y\ + ( 11 - 12 ) 

Moreover, Figure 11.15 shows that, algebraically 

yi=y t - da ' | 

which, with equation (11.8) gives 

yi = yi --T-^iVi = yi - d-^iyu (11.13) 

n I 

where 



= d/n\ 



This, with equation (11.12), gives 



jp = gP\ + 0> 2 - 



where gP is the power of the whole system. 
From Figure 11.15 we have algebraically 



P' 2 P' = - 



■Vi zll 

a'r, 



which with equations (11.11) and (11.13) gives 



-n' 2 d 0 > i 



P' 2 P - 



3P 



For a similar ray incident from the right we can find 



(11.14) 



(11.15) 



P X P 



n i d-P 2 



where P is the first principal plane (not shown in the figures). 

A more significant distance for the thick lens of Figure 11.15(b) is P 2 F' the distance 
between the second refracting surface and the focal point F' . 

Now 



p 2 F' = P'F' - P'P' 2 



which with 



P'F' = n' 2 jPP 



(11.16) 
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gives 


p p/ _ n 2 n 2 d33 \ 






2 3P 3P 








(11.17) 



We shall see in the following section that the factor 1 — d3P \ and the power 3? of the 
system arise quite naturally in the matrix treatment of this problem. 

The Matrix Method 

Tracing paraxial rays through an optical system involves the constant repetition of two 
consecutive processes and is particularly suited to matrix methods. 

A refracting R process carries the ray from one medium across a refracting surface into a 
second medium from where it is taken by a transmitting T process through the second 
medium to the next refracting surface for R to be repeated. Both R and T processes and 
their products are represented by 2 x 2 matrices. 

An R process is characterized by 

n'a' — na = 33 iy (11.7) 

which changes no. but which leaves y unaffected. 

We write this in the form 

a — a = 3? \y (11.18) 

where 

OLi — ftiQti 

The reader should review Figure 11.8 for the sign convention for angles. 

A T process is characterized by 

y — y — d’a’ (11.19) 

which changes y but leaves d unaffected. The thick lens of the last section demonstrates the 
method particularly well and reproduces the results we have already found. 

In Figure 11.16 note that 

n 2 a 2 = n\a\ 

that is 

d 2 = d\ 

We express equations (11.18) and (11.19) in a suitable 2x2 matrix form by writing them 
as separate pairs. 
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Figure 11.16 The single lens of Figure 11.15 is used to demonstrate the equivalence of the 
deviation and matrix methods for determining the important properties of a lens system. The matrix 
method is easily extended to a system of many optical elements 



For R we have 



a\ — a i + ^iyi 

where is the power of the first refracting surface and 



y'l = 0 «i + 1 v i 

so, in matrix form we have 



"«r 




"1 




(X. 1 


= Ri 


CM. 1 


LyiJ 




0 


1 


_yi _ 


_yi _ 



This carries the ray across the first refracting surface. 
For T we have 

6t2 — la j + Oy i 
yi = -d\a\ + ly, 

where 612 = a\, so 



«2 




' 1 


0 ‘ 




= T U 


a\ 




— 


_ 


1 _ 






.>' 2 . 




-d\ 


L.v'i J 


L/J 



This carries the ray through the lens between its two refracting surfaces. 
At the second refracting surface we repeat R to give 

a' 2 = 1 a 2 + & 2.V2 
y 2 = 0^2 + 1}’2 




1 

0 



^2 

V 2 



= R 2 



«2 

V2 



or 
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Therefore 



'<*2' 




OL2 




'a j " 




OL 1 


= r 2 


— ^12^12 


— R 2 T\ 2 R\ 


Lj 2 J 




. y 2 . 




bi J 




,y\ . 



1 ^2 

0 1 



1 

-d\ 



1 SP i 

0 1 



Oi \ 

y\ 



which, after matrix multiplication, gives 



Writing 




1 - d\ 9 > 2 
-d\ 



d\ 3 P x ?P, 
1 - d\9 1 



a i 

yi 



RiTnRi 



a 11 


a 12 


a 2 1 


a 22 



we see that a 12 is the power ^ of the thick lens (equation ( 11 . 14 )) and that a 22 apart from 
the factor n' 2 /&P is the distance between the second refracting surface and the second focal 
point. The product of the coefficient an and n \ jSP gives the separation between the first 
focal point and the first refracting surface. Note, too, that an and a 22 enable us to locate 
the principal planes with respect to the refracting surfaces. 

The order of the matrices for multiplication purposes is the reverse of the progress of the 
ray through R1T12R2, etc. 

If the ray experiences a number J of such transformations, the general result is 



~ a 'j' 


— RjTj-\,jRj-\ ■ 


■ • R2R 12R 1 


at 


[yj\ 






.- v i . 



The product of all these 2 x 2 matrices is itself a 2 x 2 matrix. 

It is important to note that the determinant of each matrix and of their products is unity, 
which implies that the column vector represents a property which is invariant in its passage 
through the system. 

The components of the column vector are, of course, a x y \ ; that is, na and y and we 
already know that for paraxial rays the Helmholtz equation states that the product nya 
remains constant throughout the system. 



(Problems 11.9, 11.10, 11.11) 



Problem 11.1 

Apply the principle of p. 3 1 1 to a thin bi-convex lens of refractive index n to show that its power is 



3 P = (n — 





where R \ and ^ 2 . the radii of curvature of the convex faces, are both much greater than the thickness 
of the lens. 
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Problem 11.2 

A plane parallel plate of glass of thickness d has a non-uniform refractive index n given 
by n = no — ar 2 where n o and a are constants and r is the distance from a certain line perpendicular 
to the sides of the plate. Show that this plate behaves as a converging lens of focal length 1 /2ad. 

Problem 11.3 

For oscillatory waves the focal point F of the converging wavefront of Figure 11.17 is located where 
Huygens secondary waves all arrive in phase: the point F' vertically above F receives waves whose 
total phase range A <f> depends on the path difference AF'-BF'. When F' is such that A<f> is 2n the 
resultant amplitude tends to zero. Thus, 




Figure 11.17 

the focus is not a point but a region whose width x depends on the wavelength A and the angle 6 
subtended by the spherical wave. If PF' is perpendicular to BF the phase at F' and P may be 
considered the same. Show that the width of the focal spot is given by x = A/sin 6. Note that sin 6 is 
directly related to the f/d ratio for a lens (focal length/diameter) so that x defines the minimum size 
of the image for a given wavelength and a given lens. 

Problem 11.4 

As an object moves closer to the eye its apparent size grows with the increasing angle it subtends at 
the eye. A healthy eye can accommodate (that is, focus) objects from infinity to about 25 cm, the 
closest ‘distance of distinct vision’. Beyond this ‘near point’ the eye can no longer focus and a 
magnifying glass is required. A healthy eye has a range of accommodation of 4 dioptres (l/oo to 
1/0.25 m). If a man’s near point is 40 cm from his eye, show that he needs spectacles of power equal 
to 1.5 dioptres. If another man is unable to focus at distances greater than 2 m, show that he needs 
diverging spectacles with a power of —0.5 dioptres. 

Problem 11.5 

Figure 11.18 shows a magnifying glass of power P with an erect and virtual image at / ' . The angular 
magnification 



M a = P/y 

angular size of image seen through the glass at distance /' 



angular size of object seen by the unaided eye at d t 
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where d a is the distance of distinct vision. Show that the transverse magnification M T = /'// where l 
is the actual distance (not d 0 ) at which the object O is held. Hence show that M a = d 0 /l and use the 
thin lens power equation, p. 318, to show that 

M a = d 0 (P+ l/T) = Pd 0 + 1 

when /' = d Q . Note that M a reduces to the value Pd Q when the eye relaxes by viewing the image at oo. 




Figure 11.18 



Problem 11.6 

A telescope resolves details of a distant object by accepting plane wavefronts from individual points 
on the object and amplifying the very small angles which separate them. In Figure 11.19, a is the 
angle between two such wavefronts one of which propagates along the optical axis. In normal 
adjustment the astronomical telescope has both object and image at oo so that the total power of the 
system is zero. Use equation (11.14) to show that the separation of the lenses must b e/ G +/ e where 
/ 0 and/ e are respectively the focal lengths of the object and eye lenses. 

If 2 y is the width of the wavefront at the objective and 2y' is the width of the wavefront at the eye 
ring show that 




Figure 11.19 
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where D is the effective diameter of the object lens and d is the effective diameter of the eye lens. 
Note that the image is inverted. 

Problem 11.7 

The two lens microscope system of Figure 1 1.20 has a short focus objective lens of power P 0 and a 
magnifying glass eyepiece of power P e . The image is formed at the near point of the eye (the 
distance d 0 of Problems 11.4 and 11.5). Show that the magnification by the object lens is 
M 0 = —P 0 x' where the minus sign shows that the image is inverted. Hence use the expression for 
the magnifying glass in Problem 11.5 to show that the total magnification is 

M = M 0 M t = -P 0 P e d 0 x' 

The length x' is called the optical tube length and is standardized for many microscopes at 0. 14 m. 




Figure 11.20 

Problem 11.8 

Microscope objectives are complex systems of more than one lens but the principle of the oil 
immersion objective is illustrated by the following problem. In Figure 11.21 the object O is 
embedded a distance R/n from the centre C of a glass sphere of radius 




Figure 11.21 
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R and refractive index n. Any ray OP entering the microscope is refracted at the surface of the sphere 
and, when projected back, will always meet the axis CO at the point I. Use Snell’s Law to show that 
the distance IC = nR. 



Problems 11.9, 11.10, 11.11 

Using the matrix method or otherwise, find the focal lengths and the location of the principal plane 
for the following lens systems (a), (b) and (c). The glass in all lenses has a refractive index of n = 1.5 
and all measurements have the same units. R , is a radius of curvature. 





Summary of Important Results 



Power of a Thin Lens 



2P — (n — 1) 



1 

R 7 




1 

7 



where n is the refractive index of the lens material, R \ and R 2 are the radii of curvature of 
the lens surfaces and / is the focal length. 



Power of two thin lenses separated a distance d in Air 

g> = 0> x + 3p i -d&\0> 1 



where 2? 1 and 2P 1 are the powers of the thin lenses. 
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Power of a thick lens of thickness d and refractive index n 

3? = 2P X + 0> 2 -d/n& l 0> 2 

where 3P X and { 2P 2 are the powers of the refracting surfaces of the lens. 



Optical Helmholtz Equation 

For a plane wavefront (source at oo) passing through an optical system the product 

ny tan a — constant 

where n is the refractive index, y is the width of the wavefront section and a is the angle 
between the optical axis and the normal to the wavefront. 

For a source at a finite distance, this equation becomes, for paraxial rays, 

nya — constant 




12 

Interference and Diffraction 



All waves display the phenomena of interference and diffraction which arise from the 
superposition of more than one wave. At each point of observation within the interference 
or diffraction pattern the phase difference between any two component waves of the same 
frequency will depend on the different paths they have followed and the resulting 
amplitude may be greater or less than that of any single component. Although we speak of 
separate waves the waves contributing to the interference and diffraction pattern must 
ultimately derive from the same single source. This avoids random phase effects from 
separate sources and guarantees coherence. However, even a single source has a finite size 
and spatial coherence of the light from different parts of the source imposes certain 
restrictions if interference effects are to be observed. This is discussed in the section on 
spatial coherence on p. 360. The superposition of waves involves the addition of two or 
more harmonic components with different phases and the basis of our approach is that laid 
down in the vector addition of Figure 1.11. More formally in the case of diffraction we 
have shown the equivalence of the Fourier transform method on p. 287 of Chapter 10. 

Interference 

Interference effects may be classified in two ways: 

1. Division of amplitude 

2. Division of wavefront 

1. Division of amplitude. Here a beam of light or ray is reflected and transmitted at a 
boundary between media of different refractive indices. The incident, reflected and 
transmitted components form separate waves and follow different optical paths. They 
interfere when they are recombined. 

2. Division of wavefront. Here the wavefront from a single source passes simultaneously 
through two or more apertures each of which contributes a wave at the point of 
superposition. Diffraction also occurs at each aperture. 
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The difference between interference and diffraction is merely one of scale: in optical 
diffraction from a narrow slit (or source) the aperture is of the order of the wavelength of 
the diffracted light. According to Huygens Principle every point on the wavefront in the 
plane of the slit may be considered as a source of secondary wavelets and the further 
development of the diffracted wave system may be obtained by superposing these wavelets. 

In the interference pattern arising from two or more such narrow slits each slit may be 
seen as the source of a single wave so the number of superposed components in the final 
interference pattern equals the number of slits (or sources). This suggests that the complete 
pattern for more than one slit will display both interference and diffraction effects and we 
shall see that this is indeed the case. 

Division of Amplitude 

First of all we consider interference effects produced by division of amplitude. In Fig- 
ure 12.1 a ray of monochromatic light of wavelength A in air is incident at an angle i on a 
plane parallel slab of material thickness t and refractive index n > 1. It suffers partial 
reflection and transmission at the upper surface, some of the transmitted light is reflected at 
the lower surface and emerges parallel to the first reflection with a phase difference 
determined by the extra optical path it has travelled in the material. These parallel beams 
meet and interfere at infinity but they may be brought to focus by a lens. Their optical path 
difference is seen to be 

«(AB + BD) - AC = 2«AB - AC 

= hit / cos 9 — 2t tan 9 sin i 

^yit 

= — — - (1 — sin 2 9) = 2 nt cos 9 
cos 9 

(because sin i — n sin 9) . 




Figure 12.1 Fringes of constant inclination. Interference fringes formed at infinity by division of 
amplitude when the material thickness t is constant. The mth order bright fringe is a circle centred at 
S and occurs for the constant 9 value in 2/rtcosd = (m + 
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This path difference introduces a phase difference 

2tt 

o = — 2 nt cos 8 
A 

but an additional phase change of 7r rad occurs at the upper surface. 

The phase difference <5 between the two interfering beams is achieved by writing the 
beam amplitudes as 

Vi = u(sincuf + 6/2) and y 2 = asin(cut — 6/2) 

with a resultant amplitude 

R — a[sin (cot + 6/2) + sin (cot — 6/2 ) 

= 2a sin tut cos <5/2 



and an intensity 



1 = R 2 = 4 a 2 sin 2 cut cos 2 6/2 

Figure 12.2 shows the familiar cos 2 6/2 intensity fringe pattern for the spatial part of I. 

Thus, if 2 nt cos 8 = mX On an integer) the two beams are anti-phase and cancel to give 
zero intensity, a minimum of interference. If 2ntcos8 = (m + \) A the amplitudes will 
reinforce to give an interference maximum. 

Since t is constant the locus of each interference fringe is determined by a constant value 
of 8 which depends on a constant angle i. This gives a circular fringe centred on S. An 
extended source produces a range of constant 8 values at one viewing position so the 
complete pattern is obviously a set of concentric circular fringes centred on S and formed 
at infinity. They are fringes of equal inclination and are called Haidinger fringes. They 
are observed to high orders of interference, that is values of m, so that t may be relatively 
large. 




Figure 12.2 Interference fringes of cos 2 intensity produced by the division of amplitude in Figure 
12.1. The phase difference S = ZimtcosO/X and m is the order of interference 
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(a) 




Figure 12.3 Fringes of constant thickness. When the thickness t of the material is not constant the 
fringes are localized where the interfering beams meet (a) in a real position and (b) in a virtual 
position. These fringes are almost paralleL to the line where t = 0 and each fringe defines a locus of 
constant t 



When the thickness t is not constant and the faces of the slab form a wedge, Figure 12.3a 
and b the interfering rays are not parallel but meet at points (real or virtual) near the wedge. 
The resulting interference fringes are localized near the wedge and are almost parallel to 
the thin end of the wedge. When observations are made at or near the normal to the wedge 
cos 9 ~ 1 and changes slowly in this region so that 2 nt cos 9 w 2 nt. The condition for bright 
fringes then becomes 

2 nt = (, m + |)A 

and each fringe locates a particular value of the thickness t of the wedge and this defines 
the patterns as fringes of equal thickness. As the value of m increases to m + 1 the thickness 
of the wedge increases by A/2 n so the fringes allow measurements to be made to within a 
fraction of a wavelength and are of great practical importance. 
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The spectral colours of a thin film of oil floating on water are fringes of constant 
thickness. Each frequency component of white light produces an interference fringe at that 
film thickness appropriate to its own particular wavelength. 

In the laboratory the most familiar fringes of constant thickness are Newton’s Rings. 

Newton's Rings 

Here the wedge of varying thickness is the air gap between two spherical surfaces of 
different curvature. A constant value of t yields a circular fringe and the pattern is one of 
concentric fringes alternately dark and bright. The simplest example. Figure 12.4, is a 
piano convex lens resting on a plane reflecting surface where the system is illuminated 
from above using a partially reflecting glass plate tilted at 45°. Each downward ray is 
partially reflected at each surface of the lens and at the plane surface. Interference takes 



Incident 

light 




Figure 12.4 Newton's rings of interference formed by an air film of varying thickness between the 
Lens and the optical flat. The fringes are circular, each fringe defining a constant value of the air film 
thickness 
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place between the light beams reflected at each surface of the air gap. At the lower (air to 
glass) surface of the gap there is a it rad phase change upon reflection and the centre of the 
interference fringe pattern, at the point of contact, is dark. Moving out from the centre, 
successive rings are light and dark as the air gap thickness increases in units of A/ 2 . If R is 
the radius of curvature of the spherical face of the lens, the thickness t of the air gap at a 
radius r from the centre is given approximately by r 2 /2R. In the n/tli order of 
interference a bright ring requires 

2 1 = ( m + j)A = r 2 /R 

and because t oc r 2 the fringes become more crowded with increasing r. Rings may be 
observed with very simple equipment and good quality apparatus can produce fringes for 
m > 100 . 

(Problem 12.1) 

Michelson's Spectral Interferometer 

This instrument can produce both types of interference fringes, that is, circular fringes of 
equal inclination at infinity and localized fringes of equal thickness. At the end of the 
nineteenth century it was one of the most important instruments for measuring the structure 
of spectral lines. 

As shown in Figure 12.5 it consists of two identical plane parallel glass plates G i and Gi 
and two highly reflecting plane mirrors Mi and Mi. Gi has a partially silvered back face, 
G2 does not. In the figure Gi and G2 are parallel and Mi and M2 are perpendicular. Slow, 
accurately monitored motion of Mi is allowed in the direction of the arrows but the 
mounting of M2 is fixed although the angle of the mirror plane may be tilted so that Mi 
and M2 are no longer perpendicular. 

The incident beam from an extended source divides at the back face of G 1 . A part of it is 
reflected back through G 1 to M 1 where it is returned through G 1 into the eye or detector. 
The remainder of the incident beam reaches M2 via G2 and returns through G2 to be 
reflected at the back face of G 1 into the eye or detector where it interferes with the beam 
from the Mi arm of the interferometer. The presence of G2 assures that each of the two 
interfering beams has the same optical path in glass. This condition is not essential for 
fringes with monochromatic light but it is required with a white light source where 
dispersion in glass becomes important. 

An observer at the detector looking into Gi will see Mi, a reflected image of M2 (M/ 
say) and the images Si and S ' 2 of the source provided by Mi and M2. This may be 
represented by the linear configuration of Figure 12.6 which shows how interference takes 
place and what type of fimges are produced. 

When the optical paths in the interferometer arms are equal and Mi and M2 are 
perpendicular the planes of M 1 and the image M , are coincident. However a small optical 
path difference t between the arms becomes a difference of 2 1 between the mirrored images 
of the source as shown in Figure 12 . 6 . The divided ray from a single point P on the 
extended source is reflected at M| and M2 (shown as M 2 ) but these reflections appear to 




Michelson’s Spectral Interferometer 



339 




Eye or detector 



Figure 12.5 Michelson's Spectral. Interferometer. The beam from source S splits at the back face of 
Gi, and the two parts are reflected at mirrors Mi and M 2 to recombine and interfere at the eye or 
detector. G 2 is not necessary with monochromatic light but is required to produce fringes when S is a 
white light source 



come from P i and P' 2 in the image planes of the mirrors. The path difference between the 
rays from Pi and P 2 is evidently 2t cos 9. When 2t cos 6 = mX a maximum of interference 
occurs and for constant 6 the interference fringe is a circle. The extended source produces a 
range of constant 6 values and a pattern of concentric circular fringes of constant 
inclination. 

If the path difference t is very small and the plane of Mi is now tilted, a wedge is formed 
and straight localized fringes may be observed at the narrowest part of the wedge. As the 
wedge thickens the fringes begin to curve because the path difference becomes more 
strongly dependent upon the angle of observation. These curved fringes are always convex 
towards the thin end of the wedge. 
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Figure 12.6 Linear configuration to show fringe formation by a Michelson interferometer. A ray 
from point P on the extended source S reflects at Mi, and appears to come from Pi in the reflected 
plane Si. The ray is reflected from M 2 (shown here as M'f) and appears to come from P' 2 in the 
reflected plane S' 2 . The path difference at the detector between the interfering beams is effectively 
2tcos# where t is the difference between the path lengths from the source S to the separate mirrors 
Mi and M 2 



The Structure of Spectral Lines 



The discussion on spatial coherence, p. 362, will show that two close identical sources 
emitting the same wavelength A produce interference fringe systems slightly displaced 
from each other (Figure 12.17). 

The same effect is produced by a single source, such as sodium, emitting two 
wavelengths, A and A — A A so that the maxima and minima of the cos 2 fringes for A are 
slightly displaced from those for A — AA. This displacement increases with the order of 
interference m until a value of m is reached when the maximum for A coincides with a 
minimum for A — AA and the fringes disappear as their visibility is reduced to zero. 

In 1862, Fizeau, using a sodium source to produce Newton’s Rings, found that the 
fringes disappeared at the order m = 490 but returned to maximum visibility at m — 980. 
He correctly identified the presence of two components in the spectral line. 

The visibility 



(^max fmin)/ (-^max T ^min) 



equals zero when 



mX = (, m + j)(A — AA) 

and for A = 0.5893 pm and m = 490 we have AA = 0.0006 pm (6 A), which are the 
accepted values for the D lines of the sodium doublet. 

Using his spectral interferometer, Michelson extended this work between the years 1 890 
and 1900, plotting the visibility of various fringe systems and building a mechanical 
harmonic analyser into which he fed different component frequencies in an attempt to 
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reproduce his visibility curves. The sodium doublet with angular frequency components u> 
and oj + Ac o produced a visibility curve similar to that of Figures 1 .7 and 4.4 and was easy 
to interpret. More complicated visibility patterns were not easy to reproduce and the 
modern method of Fourier transform spectroscopy reverses the procedure by extracting the 
frequency components from the observed pattern. 

Michelson did however confirm that the cadmium red line, A — 0.6438 pm was highly 
monochromatic. The visibility had still to reach a minimum when the path difference in his 
interferometer arms was 0.2 m. 



Fabry - Perot Interferometer 

The interference fringes produced by division of amplitude which we have discussed so far 
have been observed as reflected light and have been produced by only two interfering 
beams. We now consider fringes which are observed in transmission and which require 
multiple reflections. They are fringes of constant inclination formed in a pattern of 
concentric circles by the Fabry-Perot interferometer. The fringes are particularly narrow 
and sharply defined so that a beam consisting of two wavelengths A and A — AA forms two 
patterns of rings which are easily separated for small AA. This instrument therefore has an 
extremely high resolving power. The main component of the interferometer is an etalon 
Figure 12.7 which consists of two plane parallel glass plates with identical highly reflecting 
inner surfaces S i and S 2 which are separated by a distance d. 

Suppose a monochromatic beam of unit amplitude, angular frequency co and wavelength 
(in air) of A strikes the surface S 1 as shown. A fraction t of this beam is transmitted in 
passing from glass to air. At S 2 a further fraction t' is transmitted in passing from air to 
glass to give an emerging beam of amplitude tt' = T. The reflection coefficient at the air- 
Si and air-S 2 surfaces is r so each subsequent emerging beam is parallel but has an 
amplitude factor r 2 = R with respect to its predecessor. Other reflection and transmission 
losses are common to all beams and do not affect the analysis. Each emerging beam has a 
phase lag 6 = 4ird cos 6 / A with respect to its predecessor and these parallel beams interfere 
when they are brought to focus via a lens. 

The vector sum of the transmitted interfering amplitudes together with their appropriate 
phases may be written 



A = re“' + TRe^-V + 77? 2 e i(uJ ^ 25) . . . 
= T e iwf [l + Re~ iS + R 2 e~ i2S . . . 

which is an infinite geometric progression with the sum 

A = 7V“ '/(l — Re~ i6 ) 



This has a complex conjugate 



A* = Te~ iu> 7(1 -Re iS ) 
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Figure 12.7 Si and S 2 are the highly reflecting inner surfaces of a Fabry--Perot etalon with a 
constant air gap thickness d. Multiple reflections produce parallel interfering beams with amplitudes 
T, RT r R Z T, etc. each beam having a phase difference 

S = 47 rd cos 9/ A 

with respect to its neighbour 



If the incident unit intensity is I o the fraction of this intensity in the transmitted beam may 
be written 

f _AA* _ T 2 _ T 2 

To "To” ~~ (1 -Re~ i6 )( 1 -Re iS ) ~ (1 + R 2 - 2Rcos6) 

or, with 



cos 6 = 1—2 sin 2 6/2 
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Figure 12.8 Observed intensity of fringes produced by a Fabry-Perot interferometer. Transmitted 
intensity I t versus 6. R = r 2 where r is the reflection coefficient of the inner surfaces of the etalon. 
As R increases the fringes become narrower and more sharply defined 



as 



I t _ T 2 T 2 1 

Io ~ (1 — R) 2 + 4R sin 2 8/2 ~ (1 - R) 2 1 + [4R sin 2 8/2/{\ - R) 2 } 

But the factor T 2 /( 1 — R)~ is a constant, written C so 

I t 1 

_ — c 

I o l + [4Rsin 2 6/2/(l-R) 2 ] 

Writing Cl o = / max, the graph of / t versus 6 in Figure 12.8 shows that as the reflection 
coefficient of the inner surfaces is increased, the interference fringes become narrow and 
more sharply defined. Values of R > 0.9 may be reached using the special techniques of 
multilayer dielectric coating. In one of these techniques a glass plate is coated with 
alternate layers of high and low refractive index materials so that each boundary presents a 
large change of refractive index and hence a large reflection. If the optical thickness of 
each layer is A/4 the emerging beams are all in phase and the reflected intensity is high. 



Resolving Power of the Fabry - Perot Interferometer 

Figure 12.8 shows that a value of R = 0.9 produces such narrow and sharply defined 
fringes that if the incident beam has two components A and A — AA the two sets of fringes 
should be easily separated. The criterion for separation depends on the shape of the fringes: 




344 



Interference and Diffraction 




Order 
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m + Am 


m + 1 


Order 


Phase 
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Figure 12.9 Fabry-Perot interference fringes for two wavelength A and A — AA are resolved at 
order m when they cross at half their maximum intensity. Moving from order m to m + 1 changes the 
phase S by 2<t rad and the full 'half-value' width of each maximum is given by Am = 26-1/2 which is 
also the separation between the maxima of A and A — AA when the fringes are just resolved 



the diffraction grating of p. 373 uses the Rayleigh criterion, but the fringes here are so 
sharp that they are resolved at a much smaller separation than that required by Rayleigh. 

Here the fringes of the two wavelengths may be resolved when they cross at half their 
maximum intensities; that is, at 7 t = / max /2 in Figure 12.9. 

Using the expression 

It ^max ' “ ; t „ , - 

47? sin- 6/2 
1 H 

(1 -R) 

we see that 7 t = 7 max when 6 = 0 and 7 t = 7 max /2 when the factor 

47? sin 2 6/2/ {l - R) 2 = 1 

The fringes are so narrow that they are visible only for very small values of 6 so we may 
replace sin 6/2 by 6/2 in the expression 



47? sin 2 6/2/{\ -7?) 2 = 1 
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to give the value 



< 5 1/2 — 



(1 

R 1 / 2 



as the phase departure from the maximum, 5 = 0, which produces the intensity /, = 7 max / 2 
for wavelength A. Our criterion for resolution means, therefore, that the maximum intensity 
for A — AA is removed an extra amount 8\n along the phase axis of Figure 12.9. This axis 
also shows the order of interference m at which the wavelengths are resolved, together with 
the order m + 1 which represents a phase shift of 5 = 27 r along the phase axis. 

In the mth order of interference we have 

2d cos 6 = mX 

and for fringes of equal inclination ( 6 constant), logarithmic differentiation gives 

A/AA = —m/Am 

Now Am = 1 represents a phase change of S = 2n and the phase difference of 2.6\n 
which just resolves the two wavelengths corresponds to a change of order 

Am = 2.5i/2/2tt 



Thus, the resolving power, defined as 



A 

AA 



m 

Am 



mir mirR 1 / 2 

V = ( J -ft) 



The equivalent expression for the resolving power in the mth order for a diffracting 
grating of N lines (interfering beams) is shown on p. 376 to be 



A 

AA 



= mN 



so we may express 



Aft = 7tR 1/2 /( 1 -R) 



as the effective number of interfering beams in the Fabry-Perot interferometer. 

This quantity Aft is called the finesse of the etalon and is a measure of its quality. We see that 






26 



1/2 



1 

Am 



separation between orders m and m + 1 
‘half value’ width of mth order 



Thus, using one wavelength only, the ratio of the separation between successive fringes to 
the narrowness of each fringe measures the quality of the etalon. A typical value of N' ~ 30. 



Free Spectral Range 

There is a limit to the wavelength difference AA which can be resolved with the Fabry- 
Perot interferometer. This limit is reached when AA is such that the circular fringe for A in 
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the with order coincides with that for A — AA in the m + 1th order. The pattern then loses its 
unique definition and this value of AA is called the free spectral range. 

From the preceding section we have the expression 

A m 
AA Am 

and in the limit when AA represents the free spectral range then 

Am = 1 

and 

AA = —A/m 

But mX = 2d when 0 ~ 0 so the free spectral range 

AA = -A 2 /2 d 

Typically d ~ 1 0 2 m and for A (cadmium red) = 0.6438 microns we have, from 2d = mX, 
a value of 

m ~ 3 x 10 4 

Now the resolving power 



so, for 



the resolving power can be as high as 1 part in 10 6 . 

Central Spot Scanning 

Early interferometers recorded flux densities on photographic plates but the non-linear 
response of such a technique made accurate resolution between two wavelengths tedious 
and more difficult. This method has now been superseded by the use of photoelectronic 
detectors which have the advantage of a superior and more reliable linearity. Moreover, the 
response of such a device with controlled vibration of one mirror of the etalon allows the 
variation of the intensity across the free spectral range to be monitored continuously. 

The vibration of the mirror, originally electro-mechanical, is now most often produced 
by using a piezoelectric material on which to mount one of the etalon mirrors. When a 
voltage is applied to such a material it changes its length and the distance d between the 
etalon mirrors can be varied. The voltage across the piezoelectric mount is tailored to 
produce the desired motion. 

Changing d by A/2 is equivalent to changing Am by 1, which corresponds to a scan of 
the free spectral range, AA, when A/AA = |m/Am| (Figure 12.9). 



— = mN' 
AA 



N' « 30 
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E = Etalon P = Pinhole 

S = Source D = Detector 

Sc = Screen 




Figure 12.10 Fabry-Perot etalon central spot scanning. The distance between the etalon mirrors 
changes when one mirror vibrates on its piezoelectric mount. The free spectral range is scanned 
over many vibration cycles at a central spot and a stationary trace is obtained on the oscilloscope 
screen 



One of the most common experimental arrangements is that of central spot scanning 
(Figure 12.10). Where the earlier photographic technique recorded the flux density over a 
wide region for a short period, central spot scanning focuses on a single point in space for a 
long period over many cycles of the etalon vibration. Matching the time base of the 
oscilloscope to the vibration period of the etalon produces a stationary trace on the screen 
which can be measured directly in addition to being filmed for record purposes. 



The Laser Cavity 

The laser cavity is in effect an extended Fabry-Perot etalon. Mirrors coated with multi- 
dielectric films described in the next section can produce reflection coefficients R « 0.99 
and the amplified stimulated emission in the laser produces a beam which is continuously 
reflected between the mirror ends of the cavity. The high value of R allows the amplitudes 
of the beam in opposing directions to be taken as equal, so a standing wave system is 
generated (Figure 12.11) to form a longitudinal mode in the cavity. 

The superposed amplitudes after a return journey from one mirror to the other and back 
are written for a wave number k and a frequency ui = 2nv as 

£ — A [ (gi {Ut-kx) _ £ i (ut+kx)^ 

= A i ( Q-' lkx - e ifa ) e iwf = -2iA , sin kx e iut 
of which the real part is E = 2A \ sin kx sin ujt. 
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Figure 12.11 A Longitudinal mode in a Laser cavity which behaves as an extended Fabry-Perot 
etalon with highly reflecting mirrors at each end. The standing wave system acquires an extra A/2 for 
unit change in the mode number m. A typical output is shown in Figure 12.12 



If the cavity length is L, one round trip between the mirrors creates a phase change of 



<f> = —2 Lk + 2a = 



47 tL 



v + 2a 



c 



where a is the phase change on reflection at each mirror. 

For this standing wave mode to be maintained, the phase change must be a multiple of 
27 t, so for m an integer 

AttL 

<p = m2n = v — 2a 

c 



or 



me ac 

2L + 2vrZ 



When m changes to m + 1, the phase change of 27 t corresponds to an extra wavelength A 
for the return journey; that is, an extra A/2 in the standing wave mode. A series of 
longitudinal modes can therefore exist with frequency intervals Av = c/2L determined by 
a unit change in m. 

The intensity profile for each mode and the separation Av is best seen by reference to 
Figure 12.9, where f = 6 is given by the unit change in the order of interference from m to 
m T 1 • 

The intensity profile for each cavity mode is that of Figure 12.9, where the full width at 
half maximum intensity is given by the phase change 



26 l /2 — 



R 1 / 2 



where R is the reflection coefficient. This corresponds to a full width intensity change over 
a frequency dv generated by the phase change 

47tL 

dej) = dv in the expression for <i> above 
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The width at half maximum intensity for each longitudinal mode is therefore given by 



4? tL , 2(1 -R) 

do = — 

c R l ! 2 



or 



do = 



(1 -R)° 

R l / 2 2irL 



For a laser 1 m long with R = 0.99, the longitudinal modes are separated by frequency 
intervals 



Ao — — — 1.5 x 10 8 Hz 
2 L 

Each mode intensity profile has a full width at half maximum of 

do = 10~ 2 — w 4.5 x 10 5 Hz 
2tt 

For a He-Ne laser the mean frequency of the output at 632.8 nm is 4.74 x 10 14 Hz. The 
pattern for Ao and do is shown in Figure 12.12, where the intensities are reduced under the 
dotted envelope as the frequency difference from the mean is increased. 

The finesse of the laser cavity is given by 



A o 1.5 x 10 8 
~do~ 4.5 x 10 5 



300 



for the example quoted. 





f 
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dv 
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Figure 12.12 Output of a Laser cavity. A series of Longitudinal modes separated by frequency 
intervals A o = c/2L, where c is the velocity of light and L is the cavity length. The modes are 
centred about the mean output frequency and are modulated under the dotted envelope. For a He-Ne 
laser 1 m long the separation A v between the modes « 300 fuLL widths of a mode intensity profile at 
half its maximum value 
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The intensity of each longitudinal mode is of course, amplified by each passage of the 
stimulated emission. Radiation allowed from out of one end represents the laser output but 
the amplification process is dominant and the laser produces a continuous beam. 

Multilayer Dielectric Films 

We have just seen that in the /nth order of interference the resolving power of a Fabry- 
Perot interferometer is given by 

A/AA = mN' 

where the finesse or number of interfering beams 

N' = 7 tR 1/2 /( 1 - R) = 7rr/( 1 - r 2 ) 

and r is the reflection coefficient of the inner surfaces of the etalon. 

It is evident that as /■ — > 1 the values of N' and the resolving power become much larger. 
The value of r can be increased to more than 99% by using a metallic coating on the inner 
surfaces of the etalon or by depositing on them a multilayer of dielectric films with 
alternating high and low refractive indices. For a given monochromatic electromagnetic 
wave each layer or film has an optical thickness of A/4. 

The reflection coefficient r for such a wave incident on the surface of a higher refractive 
index film is increased because the externally and internally reflected waves are in phase; a 
phase change of 7r occurs only on the outer surface and is reinforced by the 7r phase change 
of the wave reflected at the inner surface which travels an extra A/2 optical distance. 

High values of r result from films of alternating high and low values of the refractive 
index because reflections from successive boundaries are in phase on return to the front 
surface of the first film. Those retarded an odd multiple of 7r by the extra optical path length 
per film also have a 7r phase change on reflection to make a total of 27 t rad. 

We consider the simplest case of a monochromatic electromagnetic wave in a medium of 
refractive index n i , normally incident on a single film of refractive index n \ , and thickness 
d\ | . This film is deposited on the surface of a material of refractive index n' 2 , which is called 
a substrate (Figure 12.13). The phase lag for a single journey across the film is written 6. 

The boundary conditions are that the components of the E and H fields parallel to a 
surface are continuous across that surface. We write these field amplitudes as Ef and 
Hf = nEf for the forward-going wave to the right in Figure 12.13 and E r and H r = nE r for 
the reflected wave going to the left. 

We see that at surface 1 the boundary conditions for the electric field E are 

E n +E rl =E' n +E' rl (12.1a) 

and for the magnetic field 

n \ Ef\ — n\E r \ = n\E /j — n[E' rl (12.1b) 

where the negative sign for the reflected amplitude arises when the E x H direction of the 
wave is reversed (see Figure 8.7). 
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Film Substrate 



Eh 



Eh 



Er 1 



E'r 1 



d't ' 

n\ 



no 



Figure 12.13 A thin dielectric film is deposited on a substrate base. At each surface an 
electromagnetic wave is normally incident, as Efj, in a medium of refractive index n,- and is reflected 
as Er,-. A multilayer stack of such films, each of optical thickness A/4 and of alternating high and low 
refractive indices can produce reflection coefficients >99% 



At surface 2 in Figure 12.13, EL arrives with a phase lag of 6 with respect to E'p at 
surface 1 but the E' rl wave at surface 2 has a phase 6 in advance of E' rl at surface 1, so we 
have the boundary conditions 



and 



E n*- iS 



/ p' e ~ iS 
1 1 c /i c 



' E' r i ~ E'f 



72 



, I rpf ~i6 



We can eliminate EL and E' rX from equations ( 12. la)— ( 12. Id) to give 



E r i 

Efi 





(12.1c) 


rr / 
J /2 


(12. Id) 


(12. Id) to give 




E'P- 

Efi 


(12.2) 



and 



E' 



! i — n i — (f n j sin 6 + n' 2 cos 6) 



-n 






(12.3) 



which we can express in matrix form 



1 " 


1 


i 


£,-i _ 


cos 6 


isin h/n) 


1 


n t 


“h 


~ n l _ 


£/i 


in'j sin <5 


cos h 


_ n 2_ 
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We write this as 



' 1 ' 




1 


r — Mi 


1 


+ 


n 1 




-n 1 _ 


L«2 J 



where r = E r \ jEj\ is the reflection coefficient at the first surface and t = Ej 2 / £/ 1 is the 
transmitted coefficient into medium n' 2 (a quantity we shall not evaluate). 

The 2x2 matrix 



M i = 



cos 6 
in\ sin <5 



isin S/n\ 
cos 6 



relates r and t across the first film and is repeated with appropriate values of /?' for each 
successive film. The product of these 2x2 matrices is itself a 2 x 2 matrix as with the 
repetitive process we found in the optical case of p. 325. 

Thus, for N films we have 



1 

n i 



1 

-n\ 



R — Ad \ Ad 2 Ad 3 ■ ■ ■ M n 



l N + 1 



(12.4) 



where R = E rl /Ef\ as before and T = E' f ^ N+ ^/Efi the transmitted coefficient 
after the final film. Note, however, that E r \ in R is now the result of reflection from all the 
film surfaces and that these are in phase. 

The typical matrix M 3 relates r to t across the third film and the product of the matrices 



Ad \AdiAd 3 ■ ■ ■ M n — Ad — 



M n 
M2 1 



M 12 

Ad 22 



is a 2x2 matrix. 

Eliminating T from the two simultaneous equations (12.4) we have, in terms of the 
coefficients of Ad 



where 



R = 



A-B 
A + B 



(12.5) 



A = ru(Adn + M i2ti' N+1 ) 



and 



B = (M 21 + M22n' N+ i) 



If we now consider a system of two films, the first of higher refractive index riu and the 
second of lower refractive index hl, where each has an optical thickness d = A/4, then the 
phase S — tt/2 for each film and 



0 


i/n H 


' 0 


i/n L 




-n L /n H 


0 


_in H 


0 


in L 


0 




0 


- n H /n L 
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A stack of N such pairs, 2 N films in all with alternating n u and n /, produces 



M\M 2 ■ ■ ■ M 2 n — [MiM 2 ] N — 



giving R the total reflection coefficient from equation (12.5) equal to 





We see that as long as tin f 1 «z., then as N — > oo, R —> 1 and this value may be used in 
our derivation of the expressions for the finesse and resolving power of the Fabry-Perot 
interferometer. 

Multilayer stacks using zinc sulphate (n H = 2.3) and cryolite (n L = 1.35) have achieved 
R values > 99.5%. 

Note that all the 2x2 matrices and their products have determinants equal to unity which 
states that the column vectors represent a quantity which remains invariant throughout the 
matrix transformations. 

(Problem 12.2) 

The Thin Film Optical Wave Guide 

Figure 12.14 shows a thin film of width d and refractive index n along which light of 
frequency v and wave number k is guided by multiple internal reflections. The extent of the 



P n' 




Figure 12.14 A thin dielectric film or fibre acts as an optical wave guide. The reflection angle 9 
must satisfy the relation n sin 9 > n 1 , where n' is the refractive index of the coating over the film of 
refractive index n. Propagating modes have standing wave systems across the film as shown and 
constructive interference occurs on the standing wave axis where the amplitude is a maximum. 
Destructive interference occurs at the nodes 
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wave guide is infinite in the direction normal to the page. The internal reflection angle 0 
must satisfy 

n sin 9 >n' 

where n ' is the refractive index of the medium bounding the thin film surfaces. Each 
reflected ray is normal to a number of wave fronts of constant phase separated by A, where 
k = 2n/X and constructive interference is necessary for any mode to propagate. Reflections 
may take place at any pair of points P and O along the film and we examine the condition 
for constructive interference by considering the phase difference along the path POQ, 
taking into account a phase difference a introduced by reflection at each of P and Q. 
Now 

PO = cos 9/d 

and 



OQ = PO cos 29 

so with 



cos 2 9 = 2 cos 2 9—1 



we have 



PO + OQ = 2d cos 9 



giving a phase difference 



Af — 4>q — 4>p — 



2i TV 
c 



(n 2d cos 9) + 2a 



Constructive interference requires 



A <t> = m 2i r 



where m is an integer, so we write 



2i TV 

m 2 tt = n 2d cos 9 — 2irAm 

c 



where 



Am = 2a/2-7r 



represents the phase change on reflection. 
Radiation will therefore propagate only when 



cos 9 = 



c(rn + Am) 
v 2nd 



for m = 0, 1 , 2, 3. 
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The condition n sin 9 > n' restricts the values of the frequency v which can propagate. If 
9 — 9 m for mode m and 



cos 9 m — (1 — sin 2 9 m ) l ^ 2 



then 



becomes 



and v must satisfy 



n sin 9 m > n 



cos 9 m < 




v> 



c(m + Am) 

2 d(n 2 — n' 2 ) ^ 2 



The mode m = 0 is the mode below which v will not propagate, while Am is a constant 
for a given wave guide. Each mode. Figure 12.14, is represented by a standing wave system 
across the wave guide normal to the direction of propagation. Constructive interference 
occurs on the axis of this wave system where the amplitude is a maximum and destructive 
interference is indicated by the nodes. 



Division of Wavefront 



Interference Between Waves from Two Slits or Sources 

In Figure 12.15 let Si and S 2 be two equal sources separated by a distance f each 
generating a wave of angular frequency u> and amplitude a. At a point P sufficiently distant 
from S 1 and S 2 only plane wavefronts arrive with displacements 



and 



y 1 = a sin (ujt — kx \ ) from S 1 



j 2 = a sin (cut — kxf) from S 2 

so that the phase difference between the two signals at P is given by 

2tt 

8 — k(x 2 — x\) — —{x 2 - x\) 

A 

This phase difference 8, which arises from the path difference X 2 - x 1 , depends only on x 1 , 
X 2 and the wavelength A and not on any variation in the source behaviour. This requires that 
there shall be no sudden changes of phase in the signal generated at either source - such 
sources are called coherent. 
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Figure 12.15 Interference at P between waves from equal sources Si and S 2 , separation/, depends 
only on the path difference x 2 — X\. Loci of points with constant phase difference 6= ( 27 r/A) 
(x 2 — Xi) are the family of hyperbolas with Sj and S 2 as foci 



The superposition of displacements at P gives a resultant 

R = y i +y 2 = «[sin ( urt — kx\) + sin (cot — kxf)} 

Writing X= (x\ + xi) /2 as the average distance from the two sources to point P we 
obtain 

kx\=kX — 8/2 and kx 2 = kX + 8/2 



to give 



R = n[sin (urt — kX + 8/2) + sin (urt — kX — 8/2)] 
= 2 a sin (urt — kX) cos 8/2 



and an intensity 

I = R 2 = 4 a 2 sin 2 (urt — kX) cos 2 8/2 
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When 



the spatial intensity is a maximum. 



cos - = ±1 
2 



I = 4a- 



and the component displacements reinforce each other to give constructive interference. 
This occurs when 

8 IT 

-= - (x 2 -Xi) = /ITT 

that is, when the path difference 



X2 — X\ — nX 



When 



8 

cos - = 
2 



0 



the intensity is zero and the components cancel to give destructive interference. This 
requires that 

8 , 7T 7T , 

-= (2/1+ l)- = -(*2-Xl) 

or, the path difference 



x 2 ~xi = (n + j)A 



The loci or sets of points for which x 2 — x\ (or 8) is constant are shown in Figure 12.15 to 
form hyperbolas about the foci S i and S 2 (in three dimensions the loci would be the 
hyperbolic surfaces of revolution). 



Interference from Two Equal Sources of Separation / 

Separation f > A. Young's Slit Experiment 

One of the best known methods for producing optical interference effects is the Young’s slit 
experiment. Here the two coherent sources. Figure 12.16, are two identical slits S i and St 
illuminated by a monochromatic wave system from a single source equidistant from S i and 
S 2 - The observation point P lies on a screen which is set at a distance l from the plane of 
the slits. 

The intensity at P is given by 

I = R 2 = 4 a 2 cos 2 - 
2 
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Figure 12.16 Waves from equal sources Si and S 2 interfere at P with phase difference 6 = (27t/A) 
(. x 2 — Xi) = (27r/A)/sin 9 w (2n/X)f(z/l). The distance l 3> z and/so SiP and S 2 P are effectively 
paralLel. Interference fringes of intensity I = 1 0 cos 2 5/2 are formed in the plane PP 0 



and the distances PP () — z and slit separation / are both very much less than / (experi- 
mentally « 1 0 3 /). This is indicated by the break in the lines x\ and xi in Figure 12.16 
where S jP and S 2 P may be considered as sufficiently parallel for the path difference to be 
written as 

X2-X1 =/sin 0 =/y 

to a very close approximation. 

Thus 



If 



<5 = 



2-7T 

T 



{X2 ~Xl) 



27T 

T 



/sin 0 




z 

7 



then 



1 — 4 a 2 cos 2 



6 

2 



I = I Q = 4a 2 



when 



6 

cos - 
2 



1 



that is, when the path difference 

f - — 0, ±A, ±2A, . . . ± n\ 
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and 



7 = 0 when cos - = 0 

2 

that is, when 

4 4 4' ±( " + « A 

Taking the point Po as z — 0, the variation of intensity with z on the screen PoP will be 
that of Figure 12.16, a series of alternating straight bright and dark fringes parallel to the 
slit directions, the bright fringes having 7 = 4a 2 whenever z = nXl/f and the dark fringes 
7 = 0, occurring when z — (n + \)Xl/f, n being called the order of interference of the 
fringes. The zero order n = 0 at the point Po is the central bright fringe. The distance on the 
screen between two bright fringes of orders n and n + 1 is given by 

, , ,, , XI XI 

Z n + 1 - Z n = [(« + 1 ) - n\ — = — 

which is also the physical separation between two consecutive dark fringes. The spacing 
between the fringes is therefore constant and independent of n, and a measurement of the 
spacing, 1 and / determines A. 

The intensity distribution curve (Figure 12.17) shows that when the two wave trains 
arrive at P exactly out of phase they interfere destructively and the resulting intensity or 
energy flux is zero. Energy conservation requires that the energy must be redistributed, and 
that lost at zero intensity is found in the intensity peaks. The average value of cos 2 6/2 is j, 
and the dotted line at 7 = 2a 2 is the average intensity value over the interference system 
which is equal to the sum of the separate intensities from each slit. 

There are two important points to remember about the intensity interference fringes 
when discussing diffraction phenomena; these are 

• The intensity varies with cos 2 6 / 2 . 

• The maxima occur for path differences of zero or integral numbers of the wavelength, 
whilst the minima represent path differences of odd numbers of the half-wavelength. 




4a 2 
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Figure 12.17 Intensity of interference fringes is proportionaL to cos 2 5/2, where 6 is the phase 
difference between the interfering waves. The energy which is lost in destructive interference 
(minima) is redistributed into regions of constructive interference (maxima) 
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Figure 12.18 The point source A produces the cos 2 interference fringes represented by the solid 
curve A'C'. Other points on the line source AB produce cos 2 fringes (the displaced broken curves B') 
and the observed total intensity is the curve DE. When the points on AB extend A'B' to C the fringes 
disappear and the fieLd is uniformly illuminated 



Spatial Coherence In the preceding section nothing has been said about the size of the 
source producing the plane wave which falls on S i and S 2 . If this source is an ideal point 
source A equidistant from Si and S 2 , Figure 12.18, then a single set of cos 2 fringes is 
produced. But every source has a finite size, given by AB in Figure 12.18, and each point 
on the line source AB produces its own set of interference fringes in the plane PPo; the eye 
observing the sum of their intensities. 

If the solid curve A'C' is the intensity distribution for the point A of the source and the 
broken curves up to B' represent the corresponding fringes for points along AB the 
resulting intensity curve is DE. Unless A'B' extends to C the variations of DE will be seen 
as faint interference bands. These intensity variations were quantified by Michelson, who 
defined the 



Visibility 



I max I n 
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The cos 2 fringes from a point source obviously have a visibility of unity because the 
minimum intensity / mm — 0. 

When A ' B ' of Figure 12.18 = A'C, the point source fringe separation (or a multiple of 
it) the field is uniformly illuminated, fringe visibility = 0 and the fringes disappear. 

This occurs when the path difference 

AS 2 — BSi « AB sin 7 = A/2 where AS 2 = AS i. 

Thus, the requirement for fringes of good visibility imposes a limit on the finite size of the 
source. Light from points on the source must be spatially coherent in the sense that 
AB sin7 <C A/2 in Figure 12.18. 

But for f -C d. 



sin 7 K,f/2d 



so the coherence condition becomes 

sin 7 = f /Id < A/2AB 
or 

AB/dCX/f 

where AB Id measures the angle subtended by the source at the plane S i S 2 • 

Spatial coherence therefore requires that the angle subtended by the source 

«A // 

where / is the linear size of the diffracting system. (Note also that A// measures B( z/l) 
the angular separation of the fringes in Figure 12.16.) 

As an example of spatial coherence we may consider the production of Young's 
interference fringes using the sun as a source. 

The sun subtends an angle of 0.018 rad at the earth and if we accept the approximation 

AB A_ A 

with A = 0.5 pm , 
we have 

0.5 

J 4(0.018) H 

This small value of slit separation is required to meet the spatial coherence condition. 
Separation f <C A (kf <C 1 where k = 2 tt/X) 

If there is a zero phase difference between the signals leaving the sources S 1 and S 2 of 
Figure 12.16 then the intensity at some distant point P may be written 
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where the path difference S 2 P — SjP = fsin9 and I s = a 2 is the intensity from each 
source. 

We note that, since/ <C A (kf <C 1), the intensity has a very small 9 dependence and the 
two sources may be effectively replaced by a single source of amplitude 2 a. 



Dipoie Radiation (/ < A) 



Suppose, however, that the signals leaving the sources S 1 and S 2 are anti-phase so that their 
total phase difference at some distant point P is 

<5 = (<*>o + kf sin 9) 



where 60 = vr is the phase difference introduced at source. 

The intensity at P is given by 

, 6 , / 7T k f sin 9 

I = 4 7 S cos 2 - = 4 I s cos 2 — I 



= 4 h shr 



I s (kf sin #)' 



kf sin 9 



because 

kf < 1 

Two anti-phase sources of this kind constitute a dipole whose radiation intensity / <C / s 
the radiation from a single source, when kf <C 1 . The efficiency of radiation is seen to 
depend on the product kf and, for a fixed separation / the dipole is a less efficient radiator at 
low frequencies (small k) than at higher frequencies. Figure 12.19 shows the radiation 
intensity I plotted against the polar angle 9 and we see that for the dipole axis along the 
direction 9 = tt / 2, completely destructive interference occurs only on the perpendicular 
axis 9 = 0 and 9 = tt. There is no direction (value of 9) giving completely constructive 
interference. The highest value of the radiated intensity occurs along the axis 9 = 7t/2 and 
9 = 37t/2 but even this is only 

I=(kf) 2 h , 

where 

kf < 1 

The directional properties of a radiating dipole are incorporated in the design of 
transmitting aerials. In acoustics a loudspeaker may be considered as a multi dipole source, 
the face of the loudspeaker generating compression waves whilst its rear propagates 
rarefactions. Acoustic reflections from surrounding walls give rise to undesirable 
interference effects which are avoided by enclosing the speaker in a cabinet. Bass reflex 
or phase inverter cabinets incorporate a vent on the same side as the speaker face at an 
acoustic distance of half a wavelength from the rear of the speaker. The vent thus acts as a 
second source in phase with the speaker face and radiation is improved. 




Interference from Linear Array of N Equal Sources 



363 




dipole 

axis 



Figure 12.19 Intensity I versus direction 6 for interference pattern between waves from two equal 
sources, n rad out of phase (dipole) with separation / <C A. The dipole axis is aLong the direction 

6 = ±tt/2 



(Problems 12.3, 12.4, 12.5) 

Interference from Linear Array of N Equal Sources 

Figure 12.20 shows a linear array of N equal sources with constant separation /generating 
signals which are all in phase (8 o = 0). At a distant point P in a direction 9 from the 
sources the phase difference between the signals from two successive sources is given by 

8 — ^-f sin 0 
A 

and the resultant at P is found by superposing the equal contribution from each source with 
the constant phase difference 8 between successive contributions. 

But we found from Figure 1.11 that the resultant of such a superposition was given by 



sin (NS/ 2) 
sin (8/2) 
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Figure 12.20 Linear array of N equal, sources separation / radiating in a direction 9 to a distant 
point P. The resulting amplitude at P (see Figure 1.11) is given by 

R = o[sin N(8/ 2) /sin (8/ 2)] 
where a is the amplitude from each source and 
8 = (27r/A)/sin 9 

is the common phase difference between successive sources 



where a is the signal amplitude at each source, so the intensity may be written 

^2 2 sin 2 (A^<5/2) sin 2 (Nitf sin 6/ X) 

sin 2 (6/2) s sin 2 (nfsin9/X) 
r &in 2 N/3 
~ /s sin 2 [3 

where I s is the intensity from each source and 3 = nf sin 0/X. 

If we take the case of N = 2, then 

7 j sin 2 2/3 AJ 2 n a, 2 $ 

I = I s ~ — = 4/ s cos 2 3 — 4/ s cos 2 - 

sin 2 (3 2 

which gives us the Young’s Slit Interference pattern. 

We can follow the intensity pattern for N sources by considering the behaviour of the 
term sin 2 /V/3/sin 2 /3. 
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We see that when 



i.e. when 



7T 



(3 = — sin#=0±7r± 2tt, etc. 
A 



/sin# = 0, ±A, ±2A...±«A 
constructive interference of the order n takes place, and 

sin 2 N/3 N 2 (3 2 



sin 



'■(3 (3 2 



■ N 



giving 

/ = N 2 1 s 

that is, a very strong intensity at the Principal Maximum condition of 

/ sin 6 = nX 

We can display the behaviour of the sin 2 N(3/sin 2 3 term as follows 

Numerator sin 2 N/3 is zero for N (3 —> Oi r . . . Nir . . . 2Nir 

Denominator sin 2 f3 is zero for (3 — > 0 ... 7r ... 2tt 



The coincidence of zeros for both numerator and denominator determine the Principal 
Maxima with the factor N 2 in the intensity, i.e. whenever / sin 0 = nX. 

Between these principal maxima are N — 1 points of zero intensity which occur 
whenever the numerator sin 2 N/3 = 0 but where sin 2 /? remains finite. 

These occur when 



/ sin 0 



X 

N' 




1) 



A 

N 



The N — 2 subsidiary maxima which occur between the principal maxima have much 
lower intensities because none of them contains the factor N 2 . Figure 12.21 shows the 
intensity curves for N = 2, 4, 8 and N — > oo. 

Two scales are given on the horizontal axis. One shows how the maxima occur at the 
order of interference n = f sind / X. The other, using units of sin f) as the ordinate displays 
two features. It shows that the separation between the principal maxima in units of sin 6 is 
A /f and that the width of half the base of the principal maxima in these units is X/Nf (the 
same value as the width of the base of subsidiary maxima). As N increases not only does 
the principal intensity increase as N 2 but the width of the principal maximum becomes 
very small. 

As N becomes very large, the interference pattern becomes highly directional, very 
sharply defined peaks of high intensity occurring whenever sin 0 changes by A//. 
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Figure 12.21 Intensity of interference patterns from Linear arrays of N equal sources of separation 
/. The horizontal axis in units of/ sin 8/ A gives the spectral order n of interference. The axis in units 
of sin 8 shows that the separation between principal maxima is given by sin 8 = A// and the half- 
width of the principal maximum is given by sin 8 = X/Nf 



The directional properties of such a linear array are widely used in both transmitting and 
receiving aerials and the polar plot for N = 4 (Figure 12.22) displays these features. For N 
large, such an array, used as a receiver, forms the basis of a radio telescope where the 
receivers (sources) are set at a constant (but adjustable) separation /and tuned to receive a 
fixed wavelength. Each receiver takes the form of a parabolic reflector, the axes of which 
are kept parallel as the reflectors are oriented in different directions. The angular separation 
between the directions of incidence for which the received signal is a maximum is given by 
sin# = A /f. 

(Problems 12.6, 12.7) 

Diffraction 

Diffraction is classified as Fraunhofer or Fresnel. In Fraunhofer diffraction the pattern is 
formed at such a distance from the diffracting system that the waves generating the pattern 
may be considered as plane. A Fresnel diffraction pattern is formed so close to the 
diffracting system that the waves generating the pattern still retain their curved 
characteristics. 
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Figure 12.22 PoLar plot of the intensity of the interference pattern from a Linear array of four 
sources with common separation /= A/2. Note that the half-width of the principal maximum is 
8 = 7t/6 satisfying the reLation sin 0 = A/A// and that the separation between principal maxima 
satisfies the relation that the change in sin# = A// 



Fraunhofer Diffraction 

The single narrow slit. Earlier in this chapter it was stated that the difference between 
interference and diffraction is merely one of scale and not of physical behaviour. 

Suppose we contract the scale of the N equal sources separation /of Figure 12.20 until 
the separation between the first and the last source, originally Nf, becomes equal to a 
distance d where d is now assumed to be the width of a narrow slit on which falls a 
monochromatic wavefront of wavelength A where d ~ A. Each of the large number N equal 
sources may now be considered as the origin of secondary wavelets generated across the 
plane of the slit on the basis of Huygens’ Principle to form a system of waves diffracted in 
all directions. 

When these diffracted waves are focused on a screen as shown in Figure 12.23 the 
intensity distribution of the diffracted waves may be found in terms of the aperture of the 
slit, the wavelength A and the angle of diffraction 9. In Figure 12.23 a plane light wave falls 
normally on the slit aperture of width d and the waves diffracted at an angle 6 are brought 
to focus at a point P on the screen PPo- The point P is sufficiently distant from the slit for all 
wavefronts reaching it to be plane and we limit our discussion to Fraunhofer Diffraction. 

Finding the amplitude of the light at P is the simple problem of superposing all the small 
contributions from the N equals sources in the plane of the slit, taking into account the 
phase differences which arise from the variation in path length from P to these different 
sources. We have already solved this problem several times. In Chapter 10 we took it as an 
example of the Fourier transform method but here we reapply the result already used in this 
chapter on p. 364, namely that the intensity at P is given by 

sin 2 N8 „ tt 

I — 7 S 0 — where N/3 = —Nf sin 8 

sin" j3 A 

is half the phase difference between the contributions from the first and last sources. But 
now Nf = d the slit width, and if we replace (3 by a where a = {tt/ A) d sin 9 is now half 
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pattern 



Figure 12.23 A monochromatic wave normally incident on a narrow slit of width d is diffracted 
through an angle 9 and the light in this direction is focused at a point P. The amplitude at P is the 
superposition of all the secondary waves in the plane of the slit with their appropriate phases. The 
extreme phase difference from contributing waves at opposite edges of the slit is <j> = 

2/t d sin 9 / A = 2 a 



the phase difference between the contributions from the opposite edges of the slit, the 
intensity of the diffracted light at P is given by 



sin 2 (7r/A)<7sin# sin 2 a 

sin 2 (7r/AiV)<7sin# s sin 2 (a/N) 



For N large 



a 



sin" m — 



N 



a 



N. 



and we have 



, sin a 

I — N 2 I S — = / o 



sin 2 a 



a- a~ 

(recall that in the Fourier Transform derivation on p. 289, 

d 2 h 2 



7 o = 



47T 2 



where h was the amplitude from each source). 

Plotting 7 = 7o(sin 2 a/a 2 ) with a = (■ it/X)d sin# in Figure 12.24 we see that its pattern 
is symmetrical about the value 

a = # = 0 

where I — I o because sin a/ a — 1 as a — > 0. The intensity 7 = 0 whenever sin a = 0 that 
is, whenever a is a multiple of 7r or 
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Figure 12.24 Diffraction pattern from a single narrow slit of width d has an intensity I = 
I o sin 2 a/a 2 where a = 7rd sin 9 / A 



giving 

d sin 6 = ±A ± 2A ± 3A, etc. 

This condition for diffraction minima is the same as that for interference maxima 
between two slits of separation d , and this is important when we consider the problem of 
light transmission through more than one slit. 

The intensity distribution maxima occur whenever the factor sin 2 a /a 2 has a maximum; 
that is, when 



d 


( sin a^ 


2 d 


( sin a^ 


da 


l a j 


da 


1 a j 



or 

cos a sin a 

9 ^ 

a a z 

This occurs whenever a — tana, and Figure 12.25 shows that the roots of this equation 
are closely approximated by a = ±37r/2, ±57 t/2, etc. (see problem at end of chapter on 
exact values). 

Table 12.1 shows the relative intensities of the subsidiary maxima with respect to the 
principal maximum Iq. 

The rapid decrease in intensity as we move from the centre of the pattern explains why 
only the first two or three subsidiary maxima are normally visible. 

Scale of the Intensity Distribution 

The width of the principal maximum is governed by the condition d sin 6 — ±A. A constant 
wavelength A means that a decrease in the slit width d will increase the value of sin 9 and 
will widen the principal maximum and the separation between subsidiary maxima. The 
narrower the slit the wider the diffraction pattern; that is, in terms of a Fourier transform the 
narrower the pulse in x-space the greater the region in k- or wave number space required to 
represent it. 
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tan a 




Figure 12.25 Position of principal and subsidiary maxima of single slit diffraction pattern is given 
by the intersections of y = a and y = tan a 



Table 12.1 



a 


sin 2 a 


Io sin 2 a 


a 2 


a 2 
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Io 


37T 


4 


Io 


T 
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57T 


4 


Io 


T 


25-7T 2 


61.7 


77 r 


4 


Io 


T 


49-7T 2 


121 



(Problems 12.8, 12.9) 



Intensity Distribution for Interference with Diffraction 
from N Identical Slits 



The extension of the analysis from the example of one slit to that of N equal slits of width d 
and common spacing/. Figure 12.26, is very simple. 
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Figure 12.26 Intensity distribution for diffraction by N equal slits is 

sin 2 a sin 2 N/3 

I ^ ® 2 • 2 n 

a. sin p 

the product of the diffraction intensity for one slit, I 0 sin 2 a/a 2 and the interference intensity 
between N sources sin 2 N/3/ sin 2 (3, where a = (n/X)d sin 9 and (3 = (n/X) f sin 6 



To obtain the expression for the intensity at a point P of diffracted light from a single 
slit we considered the contributions from the multiple equal sources across the plane of the 
slit. 

We obtained the result 



by contracting the original linear array of N sources of spacing/on p. 364. If we expand the 
system again to recover the linear array, where each source is now a slit giving us the 
diffraction contribution 



h 



sin 2 
/ o 



we need only insert this value at I s in the original expression for the interference intensity, 
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on p. 364 where 



P = jfsinO 



to obtain, for the intensity at P in Figure 12.26, the value 



sin 2 a sin 1 N [3 
or sin 2 /? 



where 



a = — d sin 6 
A 



Note that this expression combines the diffraction term sin 2 a/ a 2 for each slit (source) and 
the interference term sin 2 N/3/ sin 2 /? from N sources (which confirms what we expected 
from the opening paragraphs on interference). The diffraction pattern for any number of 
slits will always have an envelope 

sin 2 a . 

— (single slit diffraction) 



modifying the intensity of the multiple slit (source) interference pattern 



sin 2 TV/? 
sin 2 [3 



Fraunhofer Diffraction for Two Equal Slits ( N = 2) 

When N = 2 the factor 



sin 2 N/3 
sin 2 (3 



4 cos 2 /? 



so that the intensity 



I = 41 o 



sin 2 a 



cr 



cos 2 /? 



the factor 4 arising from TV 2 whilst the cos 2 /? term is familiar from the double source 
interference discussion. The intensity distribution for AT = 2,/ = 2d, is shown in Figure 12.27. 
The intensity is zero at the diffraction minima when d sin 9 = nX. It is also zero at the 
interference minima when/ sin 6 = (it + 5 ) A. 

At some value of 0 an interference maximum occurs for/ sin 8 = nX at the same position 
as a diffraction minimum occurs for d sin 8 — mX. 
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n 



d sine 



Figure 12.27 Diffraction pattern for two equal, slits, showing interference fringes modified by the 
envelope of a single slit diffraction pattern. Whenever diffraction minima coincide with interference 
maxima a fringe is suppressed to give a 'missing order' of interference 



In this case the diffraction minimum suppresses the interference maximum and the order 
n of interference is called a missing order. 

The value of n depends upon the ratio of the slit spacing to the slit width for 



i.e. 



Thus, if 



n A /sin# 
mX d sin 9 

f_ = f_ = P_ 

m d a 




2 



the missing orders will be n = 2, 4, 6, 8, etc. for m = 1, 2, 3, 4, etc. 
The ratio 

f_ = P 

d a 



governs the scale of the diffraction pattern since this determines the number of interference 
fringes between diffraction minima and the scale of the diffraction envelope is governed by a. 



(Problem 12.10) 

Transmission Diffraction Grating ( N Large) 

A large number N of equivalent slits forms a transmission diffraction grating where the 
common separation / between successive slits is called the grating space. 
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' diffraction envelope 



n = spectral order 




The intensity of each 
spectral line contains 
the factor N 2 



* 



N - 2 subsidiary maxima 



Figure 12.28 Spectral line of a given wavelength produced by a diffraction grating loses 
intensity with increasing order n as it is modified by the single slit diffraction envelope. At 
the principal maxima each spectral line has an intensity factor A/ 2 where N is the number of lines in 
the grating 



Again, in the expression for the intensity 

sin 2 a sin 2 N/3 

1 = Io 2 • o 0 

a 2 sin~p 

the pattern lies under the single slit diffraction term (Figure 12.28). 

sin 2 a 
a 2 

The principal interference maxima occur at 

/ sin# = nX 

having the factor N 2 in their intensity and these are observed as spectral lines of order n. 
We see, however, that the intensities of the spectral lines of a given wavelength decrease 
with increasing spectral order because of the modifying sin 2 a/a 2 envelope. 



Resolving Power of Diffraction Grating 

The importance of the diffraction grating as an optical instrument lies in its ability to 
resolve the spectral lines of two wavelengths which are too close to be separated by the 
naked eye. If these two wavelengths are A and A + d\ where d A/A is very small the 
Resolving Power for any optical instrument is given by the ratio \/d\. 
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Two such lines are just resolved, according to Rayleigh's Criterion, when the maximum 
of one falls upon the first minimum of the other. If the lines are closer than this their 
separate intensities cannot be distinguished. 

If we recall that the spectral lines are the principal maxima of the interference pattern 
from many slits we may display Rayleigh’s Criterion in Figure 12.29 where the nth order 
spectral lines of the two wavelengths are plotted on an axis measured in units of sin 9. We 
have already seen in Figure 12.21 that the half width of the spectral lines (principal 
maxima) measured in such units is given by X/Nf where N is now the number of 
grating lines (slits) and/is the grating space. In Figure 12.29 the nth order of wavelength A 
occurs when 



/ sin 9 = nX 



nth order spectral 
line for A 

fsine = nX 




A(sin0) = XINf 



Figure 12.29 Rayleigh's criterion states that the two wavelengths A and A + dX are just resolved 
in the nth spectral order when the maximum of one line falLs upon the first minimum of the other as 
shown. This separation, in units of sin 6, is given by X/Nf where N is the number of diffraction lines 
in the grating and/is the grating space. This leads to the resuLt that the resolving power of the 
grating X/dX = nN 
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whilst the nth order for A + dX satisfies the condition 

/[sin 9 + A(sin 6)\ = n( A + dA) 

so that 

/A (sin 0) = n dA 

Rayleigh’s Criterion requires that the fractional change 

A( si n0)=^ 

so that 

/A(sin 9) = n dA = — 

v ' N 



Hence the Resolving Power of the diffraction grating in the nth order is given by 



A 

dA 



= Nn 



Note that the Resolving Power increases with the number of grating lines N and the 
spectral order n. A limitation is placed on the useful range of n by the decrease of intensity 
with increasing n due to the modifying diffraction envelope 



sin 2 a 



a- 



(Fig. 12.28) 



Resolving Power in Terms of the Bandwidth Theorem 

A spectral line in the nth order is formed when / sin 9 = n\ where / sin 9 is the path 
difference between light coming from two successive slits in the grating. The extreme path 
difference between light coming from opposite ends of the grating of N lines is therefore 
given by 



Nf sin 9 = NnX 



and the time difference between signals travelling these extreme paths is 



A t = 



NnX 

c 



where c is the velocity of light. 

The light frequency v = cjX has a resolvable differential change 



|Az/| = c 



JAA| 

A 2 



c 

NnX 



because AA/A = l/Nn (from the inverse of the Resolving Power). 
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Hence 



NnX At 

or Av At — 1 (the Bandwidth Theorem). 

Thus, the frequency difference which can be resolved is the inverse of the time difference 
between signals following the extreme paths 

(A v At — 1 is equivalent of course to A lo At = 2i r) 

If we now write the extreme path difference as 

NnX = Ax 

we have, from the inverse of the Resolving Power, that 

AA _ 1 
~X~Nn 



so 

|AA| m = A* = _L = _L 

A 2 \A/ 2-7T NnX Ax 



where the wave number lc = 2n/X. 
Hence we also have 



Ax A k = 2ir 



where A k is a measure of the resolvable wavelength difference expressed in terms of the 
difference Ax between the extreme paths. 

On pp. 70 and 7 1 we discussed the quality factor Q of an oscillatory system. Note that 
the resolving power may be considered as the Q of an instrument such as the diffraction 
grating or a Fabry-Perot cavity for 



A v 
AX Av 



LU 

A UJ 



Q 



(Problems 12.11, 12.12, 12.13, 12.14) 

Fraunhofer Diffraction from a Rectangular Aperture 

The value of the Fourier transform method of Chapter 10 becomes apparent when we 
consider plane wave diffraction from an aperture which is finite in two dimensions. 

Although Chapter 10 carried through the transform analysis for the case of only one 
variable it is equally applicable to functions of more than one variable. 

In two dimensions, the function /(x) becomes the function fix. y), giving a transform 
F(k x . ky) where the subscripts give the directions with which the wave numbers are 
associated. 
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Figure 12.30 Plane waves of monochromatic light incident normalLy on a rectangular aperture are 
diffracted in a direction k. All light in this direction is brought to focus at P in the image plane. The 
amplitude at P is the superposition of contributions from all the typical points, x, y in the aperture 
plane with their appropriate phase relationships 



In Figure 12.30 a plane wavefront is diffracted as it passes through the rectangular 
aperture of dimensions d in the x-direction and b in the y-direction. The vector k, which is 
normal to the diffracted wavefront, has direction cosines / and m with respect to the x- and 
y-axes respectively. This wavefront is brought to a focus at point P, and the amplitude at P 
is the superposition of the contributions from all points (x,y) in the aperture with their 
appropriate phases. 

A typical point (x, y) in the aperture may be denoted by the vector r; the difference in 
phase between the contribution from this point and the central point O of the aperture is, of 
course, (2n/X) (path difference). But the path difference is merely the projection of the 
vector r upon the vector k, and the phase difference is k r = ( 2n/X)(lx + my), where 
lx + my is the projection of r on k. 

If we write 

27 d 271772 

— = k x and = k v 

X X 

we have the Fourier transform in two dimensions 



F(k x ,k y ) 



1 

(27T) 1 




*oo 

f{x,y) e -i (***+*jo0 dxdy 

— OO 



where f(x,y) is the amplitude of the small contributions from the points in the aperture. 
Taking /(x, y) equal to a constant a, we have F(k x ,k y ) the amplitude in k-space at P 



(27 t)‘ 

a 



+d/2 



-d/2 



+b/2 



-b/2 



sin a sin/3 

~r^bd — 

47T‘ a (3 



~ ikxX e ~'k y y dx cfy 



e 
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where 



7T Id k x d 

a= x = ir 



nmb k y b 

^ = ^r = x 

Physically the integration with respect to y evaluates the contribution of a strip of the 
aperture along the y direction, and integrating with respect to x then adds the contributions 
of all these strips with their appropriate phase relationships. 

The intensity distribution of the rectangular aperture is given by 



/ = / o 



sin 2 a sin 2 (3 



and relative intensities of the subsidiary maxima depend upon the product of the two 
diffraction terms sin 2 a/a 2 and sin 1 (3/ (3 2 . 

These relative values will therefore be numerically equal to the product of any two terms 
of the series 

4 4 4 

97T 2 ’ 257T 2 ’ 497T 2 ’ 

The diffraction pattern from such an aperture together with a plan showing the relative 
intensities is given in Figure 12.31. 



Fraunhofer Diffraction from a Circular Aperture 

Diffraction through a circular aperture presents another two-dimensional problem to which 
the Fourier transform technique may be applied. 

As in the case of the rectangular aperture, the diffracted plane wave propagates in a 
direction k with direction cosines 1 and m with respect to the x- and y-axes (Figure 12.32a). 
The circular aperture has a radius r 0 and any point in it is specified by polar coordinates 
(r, 6) where x = r cos 0 and y = r sin 0. This plane wavefront in direction k is focused at a 
point P in the plane of the diffraction pattern and the amplitude at P is the superposition of 
the contributions from all points (r, 9) in the aperture with their appropriate phase 
relationships. The phase difference between the contribution from a point defined (x,y) and 
that from the central point of the aperture is 



27t 2tt 

—— (path difference) (lx ■ my) = k x x + k y y 
A A 

as with the rectangular aperture, so that the Fourier transform becomes 



( 12 . 6 ) 



1 DO OG 

F(k x k y ) - —2 /(*, y) e ' i(M+M dx dy (12.7) 

( 2tT ) — oo —oo 




380 



Interference and Diffraction 




c b a I a b c 

Wide diffraction pattern from / dimension 



Figure 12.31 The distribution of intensity in the diffraction pattern from a rectangular aperture is 
seen as the product of two single-slit diffraction patterns, a wide diffraction pattern from the narrow 
dimension of the slit and a narrow diffraction pattern from the wide dimension of the slit. This 
'rotates' the diffraction pattern through 90° with respect to the aperture 



If we use polar coordinates, f(x, y) becomes f(r. 9) and dr cly becomes rdrdO, where the 
limits of 9 are from 0 to 2i r. Moreover, because of the circular symmetry we may simplify 
the problem. The amplitude or intensity distribution along any radius of the diffraction 
pattern is sufficient to define the whole of the pattern, and we may choose this single radial 
direction conveniently by restricting k to lie wholly in the xz plane (Figure 12.32b) so that 
m = k y — 0 and the phase difference is simply 



— lx — k x x — k x r cos 9 
A 



Assuming that f(r, 9) is a constant amplitude a at all points in the circular aperture, the 
transform becomes 



F(k x ) 




-ik x rcos,0 r 



( 12 . 8 ) 



This can be integrated by parts with respect to r and then term by term in a power series 
for cos 9, but the result is well known and conveniently expressed in terms of a Bessel 
function as 



F(k x ) — -j-—Ji(k x ro) 



where Ji(k x ro ) is called a Bessel function of the first order. 
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Plane of Plane of 




Figure 12.32 (a) A plane monochromatic wave diffracted in a direction k from a circuLar aperture is 

focused at a point P in the image plane. Contributions from all points x, y in the aperture superpose 
at P with appropriate phase relationships, (b) The direction k of (a) is chosen to lie wholly in thexz- 
pLane to simplify the analysis. No generality is lost because of circular symmetry. The variation of the 
amplitude of diffracted Light along any one radius determines the complete pattern 



Bessel functions are series expansions which are analogous to sine and cosine functions. 
Where sines and cosines are those functions which satisfy rectangular boundary conditions 
defined in Cartesian coordinates, Bessel functions satisfy circular or cylindrical boundary 
conditions requiring polar coordinates. 

Standing waves on a circular membrane, e.g. a drum, would require definition in terms of 
Bessel functions. 

The Bessel function of order n is written 

, . _ x n f x 2 x 4 

~ 2 "rc! \ ~ 2 ■ 2n + 2 + 2 ■ 4 ■ 2/z + 2 ■ In + 4 

so that 

3 5 7 

XX X X 

1 W = 2 “ 224 + 2 2 4 2 6 “ 2 2 4 2 6 2 8 

The expression a 2 r^ [Ji(k x ro)/k x ro] 2 , which measures the intensity along any radius of the 
diffraction pattern due to a circular aperture is normalized and plotted in Figure 12.33. 
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Figure 12.33 Intensity of the diffraction pattern from a circular aperture of radius r 0 versus t', the 
radius of the pattern. The intensity is proportional to [Ji(k x r 0 )/k x r 0 ] 2 , where J 1 is Bessel's function 
of order 1. The pattern consists of a central circular principal maximum surrounded by a series of 
concentric rings of minima and subsidiary maxima of rapidly diminishing intensity 



J | (k x r {] ) has an infinite number of zeros, and the diffraction pattern is formed by an 
infinite number of light and dark concentric rings. The first dark band will occur at the first 
zero of J\(k x rf) which is given by k x r {] = 1.2197T. 

However, 

/ 2n r 2n ■ a> 

k x r 0 = — lr 0 = — r 0 sin 0_ 



where 0' 7 is the angle between the vector k and the z-axis and defines the angle of 
diffraction. The first minimum therefore occurs at ro sin O' = 0.61 A and the next minimum 
at r o sin#' = 1.1 6 A. 

If the aperture were square with a side length 2ro (the diameter of the circle) the first dark 
fringe would be at ro sin O' = 0.5A and the second at r 0 sin O' = A. 

As the radius of the circular aperture is reduced the value of #' for the first minimum 
is increased and the whole pattern expands. This reminds us that a reduction of the pulse in 
x-space requires an increase in wave number or k-space to represent it. 

We may write equation (12.8) as 



F(kx) 




*27 T 

e — ir„-cos o rdrd9 

. 0 



where e livrcos e d6 = 2TrJo(k x r) and Jq is the Bessel function of order zero. 
Then 



F(k x ) = a 



Jo(k x r)rdr 
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Now J\ (k x r) and Jo(k x r)rdr are related by 



•k v r 0 

Jo(k x r)k x rd(k x r) = k x r 0 Ji(k x r 0 ) 
. o 



giving 



F(k x ) 



anrQ 



2J\ ( k x r 0 ) 
k x ro 



where ro is the radius of the aperture. 
The Intensity 



/ = /o 



Ji(k x ro) 

k x r 0 



with the curve shown in Figure 12.33. 

Fraunhofer Far Field Diffraction 

If we remove the focusing lens in Figure 12.32 and leave the aperture open or place the 
lens within it we have the conditions for far held diffraction. Figure 12.34, where R' 0 the 
distance from O to P' is distances in the aperture and image planes from the optic axis. 
The aperture is uniformly illuminated by a distant monochromatic source and a small area 
ds = dxdy in the aperture is <C A 2 , where A is the wavelength. 




Figure 12.34 In Fraunhofer far field diffraction the distance from the aperture to the image point 
P' is 3> distances in the aperture and image planes from the optic axis. The electric field at P' is the 
integral of the spherical waves from small areas ds in the aperture plane and the resulting intensity 
pattern is that of Figure 12.33. It is known as the Airy disc 
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The electric field at P' due to the spherical wave from ds is 

dEpr =^e iut - kR 'ds 
F R' 

Where Ee IUJt is the field at ds 
Now 

R * = f + ^-~x)2 + (y'-y) 2 

and 

R '2 = f- +x r2 +y ,2 



which combine to give 

R' = <[i + (* 2 + f)/R'o - 2(x'x + /50/<] 1/2 



and Rtf » (x 2 + y 2 ) 
so we write 



If = R , 0 [l^2(x'x + y , y)/R , tf] 1/2 



and if we neglect higher terms 



R r = R r 0 [\-(x'x + y l y)/Rtf] 

_ _ x'x _ y'y 

0 p/ p/ 

We use this value for R 1 in the expression for dE„i to give the total field at P' as 



Ep' = 



Ee 



iCJt—kR'r, 



R'r, 



ikfEdil 

e R « ds 



aperture 



Comparison with equation (12.6) shows that kx/R ' 0 — kl and ky/R' () = km of that 
equation and proceeding via polar co-ordinates we obtain the same value for the intensity 
of the diffraction pattern, 
i.e. 



,/i(&rosin6f 2 )\ 7 
krosin 0' 2 ) 



in Figure 12.33 



This far field diffraction pattern is known as the Airy disc, Figure 12.35, and its size places 
a limit on the resolving power of a telescope. When the two components of a double star 
with an angular separation Af are viewed through a telescope with an objective lens of 
focal length / and diameter d their images will appear as two Airy discs separated by the 
angle Af. The two diffraction patterns will be resolved if Af is much wider than the 
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Figure 12.35 Photograph of an Airy disc showing the central bright disc, the first dark ring and the 
first subsidiary maximum. Compare this with Figure 12.33 



angluar width of a disc but not if it is much less. Lord Rayleigh’s criterion (Figure 12.29) 
gives the critical angle A<f for resolution as that when the maximum of one disc falls on the 
first minimum of the other A, Figure 12.36. Figure 12.33 then gives 



Acj> 



0.61A 

ro 



1.22A 

d 



(Af = ©' in Figure 12.33) 



where A is the rediated wavelength. 




Figure 12.36 Two stars with angular separation A</> form separate Airy disc images when viewed 
through a telescope. RayLeigh's criterion (Figure 12.29) states that the these images are resolved 
when the central maximum of one falls upon the first minimum of the other 
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This condition is known as diffraction-limited resolution. A poor quality lens will 
introduce aberrations and will not meet this criterion. 



The Michelson Stellar Interferometer 

In the discussion on Spatial Coherence (p. 360) we saw that the relative displacement of the 
interference fringes from separate sources 1 and 2 led to a partial loss of the visibility of the 
fringes defined as 

y Anax A rim 
Anax T An in 

and eventually when the displacement was equal to half a fringe width V = 0 and there was 
a complete loss of contrast. 

Michelson’s Stellar Interferomenter (1920) used this to measure the angular separation 
between the two components of a double star or, alternatively, the angular width of a 
single star. 

Initially, we take the simplest case to illustrate the principle and then discuss the 
practical problems which arise. We assume in the first instance that light from the stars is 
monochromatic with a wavelenght Ao- Michelson used four mirros Mi Mi M 3 M 4 mounted 
on a girder with two slits .S’ 1 and S 2 in front of the lens of an astronomical telescope. Fig- 
ure 12.37. The slits were perpendicular to the line joining the two stars. The separation h of 
the outer pair of mirrors (^meters) was increased until the fringes observed in the focal 
plane of the objective just disappeared. Assuming zero path difference between Mi M 2 Po 
and M 4 M 3 Po the light from star A will form its zero order fringe maximum at Po and its 
first order fringe maximum at Pi , due to a path difference S 2 N = d sin ft — Ao so the fringe 
spacing is determined by d , the separation between the inner mirrors Mi and M 3 . 

The condition for fringe disappearance is that rays from star B will form a first order 
maximum fringe midway between Po and Pi, that is, when 

CM 1 M 2 S 1 P 0 - M 4 M 3 S 2 P 0 = CMi = h sin 0 = A 0 /2 

The condition for fringe disappearnce is therefore determined by h while the angular size 
of the fringes depends on d so there is an effective magnification of h/d over a fringe 
system produced by the slits alone. 

The angles 6 and 0 are small and the minimum value of h is found which produces 
V = 0 so that the fringes disappear at 

hf = Ao/2 or h = — 

2 0 

Measurement of h thus determines the double-star angular separation. 

Several assumptions have been made in this simple case presentation. First, that the 
intensities of the light radiated by the stars are equal and that they are coherent soruces. In 
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Figure 12.37 In the Michelson stellar interferometer Light from stars A and B strike the movable 
outer mirrors Mi and M 4 to be reflected via fixed mirrors Af 2 and M 3 through two slits 5 3 and S 2 and a 
Lens to form interference fringes. Light from Star A forms its zero order fringe at P 0 and its first order 
fringe at Pi when S 2 /V = c/sin 6 = Ao. The minimum separation h of M 1 M 4 is found for light from B to 
reduce the fringe visibility to zero, that is, when the path difference h = sin f = Ao/2. The angles 
are so small that 9 and f replace their sines. Note that the fringe separation depends on d, but the 
fringe visibility is governed by h 



fact, even if the sources are incoherent their radiation is essentially coherent at the 
interferometer. Second, the radiation is not monochromatic and only a few fringes around 
the zero order were visible so Ao must be taken as a mean wavelength. Finally, the 
introduction of a lens into the system inevitably creates Airy discs and the visibility must 
be expressed in terms of the Airy disc intensity distribution. This results in 



V — 2 



J i(m) 

u 



where 



u = Trh(j >/ \q 
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If this visibility is plotted against h<f>/ Ao its first zero occurs at 1.22 so the fringes disappear 
when h = 1.22Ao /f. 

In fact, Michelson first used his interferometer in 1920 to measure the angular diameter 
of the star Betelgeuse the colour of which is orange. His astronomical telescope was the 
2.54 m (100 in.) telescope of the Mt. Wilson Observatory. A mean wavelength 
Ao — 570 x 10 9 m was used and the fringes vanished when h = 3.07 m to give an angular 
diameter (j> — 22.6 x 10 8 radians or 0.047 arc seconds. The distance of Betelgeuse from 
the Earth was known and its diameter was calculated to be about 384 x 10 6 km, roughly 
280 times that of the Sun. This magnitude is greater than that of the orbital diameter of 
Mars around the Sun. 

The Convolution Array Theorem 

This is a very useful application of the Convolution Theorem p. 297 5th edn, when one of 
the members is the sum of a series of 5 functions. 

e.g. 



g(x) = f\ (x) 0 6(x - x m ) 

m 

f\ (x') E s ( x -x! - x m )dx' 

50 m 

= E h ( x “ x ™) 



This is a linear addition of functions each of the form f\ (x) but shifted to new origins at 
x m (m = 1, 2, 3 . . .), Figure 12.38. 

The convolution theorem gives the Fourier Transform of g(x) as 



F[g(x)]=F[f 1 (x)]F 



E s ( x - x '») 

m 



i.e. 

F(k x ) = 

m 



so the transform of the spatially shifted local function is just the product of the transform of 
the local function and a phase factor. 

This is the Array Theorem which we now apply in a more rigorous approach to the effect 
of diffraction on the interference fringes in Young’s slit experiment (p. 358) where the 
illuminating source is equidistant from both slits. 

The Array Theorem may be applied to any combination of identical apertures but 
Young’s experiment involves only the two rectangular (slits) pulses in Figure 12.39a. Here, 
/i(x) is a rectangular pulse of width d and the x m values above are x m = ± a/2. 
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Figure 12.38 In the convolution array theorem a function f\{x) is convolved with a series of Dirac 
functions which shift it to new origins 



Thus, we have the transform amplitude 

F(k x ) = F l {k x )Y j e- ikxX ' 

m 



where k x = k ■ x = kx sin 6 and k in Figure 13.39b is the vector direction from x— —a/2to 
a point P on the diffraction-interference pattern, p. 288 gives 



where 



F\{k x ) oc 



sin a 

a 



a = — d sin 9 
A 



The second term, a phase factor, is 



J2e- ik ' x " - [ e ‘W 2 + g-*W2] = 2 cos k x a/ 2 
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(b) -a/2 0 +a/2 

Figure 12.39 Young's double slit experiment represented in convolution array theorem (a) by two 
reactanguLar pulses and (b) with a path difference in the direction k of dsm<f> where a is the 
separation between the pulse centres 
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This method can be extended to produce the pattern for a diffraction grating of N 
identical slits. 



The Optical Transfer Function 

The modern method of testing an optical system, e.g. a lens, is to consider the object 
as a series of Fourier frequency components and to find the response of the system to 
these frequencies. A test chart with a sinusoidal distribution of intensity would make a 
suitable object for this purpose. The function of the lens or optical system is considered to 
be that of a linear operator which transforms a sinusoidal input into an undistorted 
sinusoidal output. 

The linear operator is defined in terms of the Optical Transfer Function (OTF) 
which may be real or complex. The real part, the Modulation Transfer Function (MTF), 
measures the effect of the lens on the amplitude of the sinusoidal input; the complex 
element is the Phase Transfer Function (PTF), a shift in phase when aberrations are present. 
If there are no aberrations and the effect on the image is limited to diffraction the PTF is 
zero. 

Changing the amplitude of the object frequency components affects the contrast between 
different parts of the image compared with the corresponding parts of the object. We shall 
evaluate this effect at the end of the analysis. 

We shall assume that the object is space invariant and incoherent. Space invariance 
means that the only effect of moving a point source over the object is to change the location 
of the image. When an object is incoherent its intensity or irradiance varies from point to 
point and all contributions to the final image are added under the integral sign. 

Over a small area dxdy of the object the radiated flux will be l()(x.y)dxdy and this makes 
its contribution to the image intensity. In addition, every point source on the object creates 
a circular diffraction pattern (Airy disc) around the corresponding image point so the 
resulting intensity of the image at ( x',y ') will be 

d /'(*',/) = h{x, y)0(x, y, xV )dx dy 

where 0(x,y,x'y') is the radially symmetric intensity distribution of the diffraction pattern 
(Airy disc). In this context it is called the Point Spread Function (PSF). 

Adding all contributions gives the image intensity 



A*',/) 




»oo 

I 0 {x, y)0(x, y, x'y') dr dy 

J — OO 



If, as we shall assume for simplicity, the magnification is unity, there is a one-to-one 
correspondence between the point (x, y) on the object and the centre of its diffraction 
pattern in the image plane. Using (x. y) as the coordinate of this centre the value of 
0(x,y,x' ,y') at any other point (x 1 ,y r ) in the diffraction pattern is given by 



0(x - x, / - y) 
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Thus, the intensity or irradiance at any image point may be written 



/V,/) = 



Io(x, y)Q(x! - x,y' - y)dxdv 



This is merely the two-dimensional form of the convolution we met on p. 293 and we 
reduce it to one dimension by writing 



/'(*') = 



7o(x) 0(x' — x)dx = Io(x' — x) 0(x)dx 



because the convolution theorem of p. 297 allows us to exchange the variables of the 
functions under the convolution integral. 

This is evidently of the form 



/' = 7 0 ® O 



with Fourier Transforms 



F(I') = F(I 0 ) ■ F(O) 



The choice of one dimension which adds clarity to the following analysis tranforms the 
PSF to a Line Spread Function (LSF) by cutting a narrow slice from the three-dimensional 
PSF. This is achieved by using a line source represented by a Dirac 8 function, the sifting 
property of which isolates an infinitesimally narrow section of the PSF. 

The shape of the three-dimensional PSF may be imagined by rotating Figure 12.33 about 
its vertical axis for a complete revolution. The profile of a slice along the diameter through 
the centre of the PSF is then the intensity of Figure 12.33 together with its reflection about 
the vertical axis. Any other slice, not through the centre, will have a similar profile but will 
differ in some details, e.g. its minimum values will not be zero, Figure 12.40. 

Thus, in one dimension, replacing O(x) by L(x) the LSF, we have 



Z'(x') 



*oo 

Io(x' — x) L(x)dx 

J — oo 



or 



with 



/' = / 0 <g> L = L ® I 0 
F(I') = F(/ 0 ) ■ F(L) = F(L) ■ F(/ 0 ) 



Let us write the intensity distribution of an object frequency component in one dimension 
as a + bcosk x x, where b modulates the cosine and a is a positive d.c. bias greater than b so 
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Figure 12.40 The profile of the Line Spread Function L(x) is formed by cutting an off-centre slice 
from the three-dimensional Point Spread Function: L(x) is the area under the curve. Note that the 
minimum values of L(x) are non-zero, unlike the curve of Figure 12.33 



that the intensity is always positive. Then, in the convolution above 

Iq = a + bcosk x (x — x ) 



and the image intensity at x! is 

fiOC 

l'(x ) =| [a + bcosk x (x ' — x)\L(x) dr 

XD 

L(x)[a + bcosk x (x — x) dr 

X) 

We remove the x! terms from the integral by expanding the cosine term to give 



I\f) = a 



L(x) dx + b cos k x x 



L(x) cos k x xdx + b sin k x x 



L(x) sin k x x dr 

) 

(12.9) 



The integrals in the second and third terms on right-hand side of this equation are, 
repectively, the cosine and sine Fourier transforms from pp. 285, 286. 

If we write 



C(k x ) = 



L(x)cosk x xdx 



and 



S(k x ) 



*oo 

L(x) sinA: A xdr 

J — CX) 
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we have 

•OO 

C(k x ) - i S(k x ) = L(x)e~ iklcX dx = F{L X ) = 

— OO 

where 

M(kx) — [C(k x ) 2 + S(k x ) 2 ] 1 / 2 

is the MTF and e~'^ kx) is the PTF with 

tan<^ = S(k x )/C(k x ) 

The OTF is, therefore, the Fourier transform of the LSF. 

If the LSF is symmetrical, as in the case of the diffraction pattern, the odd terms in S(k x ) 
are zero, so the phase change f = 0 and the OTF is real. 

For a given frequency component n we can normalize L(x) to give 

L(jc) 

jZo L "( x ) ix 

so that equation (12.9) becomes 

f (x) — a + M(k x )b(cosk x x'cos(j) — sink x x' sinf) 

= a + M(k x )b(cosk x x 1 + <j>) 

In the absence of aberrations, that is, in the symmetric diffraction limited case, f = 0. Iq is 
shown in Figure 12.41(a) and I’ (x r ) in Figure 12.41(b) where f f 0 due to aberrations. 



I 0 M 



(a) 




Figure 12.41 (a) The object frequency component a + b cos k x x is modified by the Optical Transfer 

Function 
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Figure 12.41 (b) In the image component a + M(k)bcos (k x x' + (f>), M(k) is the Modulation 

Transfer Function, which is < 1 and the phase change 0 results from aberrations. The contrast in the 
image is less than that in the object. Note that in (b) 0 is negative in the expression cos (k x x' + 0 ) 



The effect of the MTF on the amplitude of the frequency components is to reduce the 
contrast between parts of the image compared with corresponding parts of the object. 

We have already met an expression for the contrast which we called Visibility on p. 360. 
Thus, we can write 



Contrast = 



/max /min 
/mix T /min 



(a + b) — (a — b) b 

(a + b) + (a — b) a 



for the object 



The image contrast M(k x )b/a < b/a so the image contrast is less than that of the object. 



Fresnel Diffraction 

The Straight Edge and Slit 

Our discussion of Fraunhofer diffraction considered a plane wave normally incident upon a 
slit in a plane screen so that waves at each point in the plane of the slit were in phase. Each 
point in the plane became the source of a new wavefront and the superposition of these 
wavefronts generated a diffraction pattern. At a sufficient distance from the slit the 
superposed wavefronts were plane and this defined the condition for Fraunhofer diffraction. 
Its pattern followed from summing the contributions from these waves together with their 
relative phases and on p. 21 we saw that these formed an arc of constant length. When the 
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contributions were all in phase the arc was a straight line but as the relative phases 
increased the arc curved to form closed circles of decreasing radii. The length of the chord 
joining the ends of the arc measured the resulting amplitude of the superposition and the 
square of that length measured the light intensity within the pattern. 

Nearer the slit where the superposed wavefronts are not yet plane but retain their curved 
character the diffraction pattern is that of Fresnel. There is no sharp division between 
Fresnel and Fraunhofer diffraction, the pattern changes continuously from Fresnel to 
Fraunhofer as the distance from the slit increases. 

The Fresnel pattern is determined by a procedure exactly similar to that in Fraunhofer 
diffraction, an arc of constant length is obtained but now it convolutes around the arms of a 
pair of joined spirals. Figure 12.42, and not around closed circles. 

An understanding of Fresnel diffraction is most easily gained by first considering, not the 
slit, but a straight edge formed by covering the lower half of the incident plane wavefront 
with an infinite plane screen. The undisturbed upper half of the wavefront will contribute 
one half of the total spiral pattern, that part in the first quadrant. 



y = Jsm^-KU 2 <Au 




Figure 12.42 Cornu spiral associated with FresneL diffraction. The spiral in the first quadrant 
represents the contribution from the upper half of an infinite plane wavefront above an infinite 
straight edge. The third quadrant spiraL results from the downward withdrawal of the straight edge. 
The width of the wavefront contributing to the diffraction pattern is correlated with the length u 
along the spiral. The upper half of the wavefront above the straight edge contributes an intensity 
(OZi) 2 which is the square of the Length of the chord from the origin to the spiral eye. This intensity 
is 0.25 of the intensity (Z 1 Z' 1 ) z due to the whoLe wavefront 
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Undisturbed 

intensity 



Figure 12.43 Fresnel diffraction pattern from a straight edge. Light is found within the geometric 
shadow and fringes of varying intensity form the observed pattern. The intensity at the geometric 
shadow is 0.25 of that due to the undisturbed wavefront 



The Fresnel diffraction pattern from a straight edge, Figure 12.43, has several significant 
features. In the first place light is found beyond the geometric shadow; this confirms its 
wave nature and requires a Fluygens wavelet to contribute to points not directly ahead of it 
(see the discussion on p. 305). Also, near the edge there are fringes of intensity greater and 
less than that of the normal undisturbed intensity (taken here as unity). On this scale the 
intensity at the geometric shadow is exactly 0.25. 

To explain the origin of this pattern we consider the point O at the straight edge of Fig- 
ure 12.44 and the point P directly ahead of O. The line OP defines the geometric shadow. 
Below O the screen cuts off the wavefront. The phase difference between the contributions 
to the disturbance at P from O and from a point H, height h above O is given by 

, . 27 r . , 2it l h 2 

A ( /,) = t (HP-OP)= t - t 

where OP = l and higher powers of h 2 /l 2 are neglected. 

We now divide the wavefront above O into strips which are parallel to the infinite 
straight edge and we call these strips ‘half period zones’. This name derives from the fact 
that the width of each strip is chosen so that the contributions to the disturbance at P from 
the lower and upper edges of a given strip differ in phase by tt radians. 

Since the phase A (h) oc h 2 we shall not expect these strips or half period zones to be of 
equal width and Figure 12.45 shows how the width of each strip decreases as h increases. 
The total contribution from a strip will depend upon its area; that is, upon its width. The 
amplitude and phase of the contribution at P from a narrow strip of width dh at a height h 
above O may be written as (d/ 7 ) e lA where A = irh 2 /\l. 

This contribution may be resolved into two perpendicular components 



dx = d h cos A 
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-.P 



Semi-infinite HP - OP. = h 2 /l 

screen 



Figure 12.44 Line OP normal to the straight edge defines the geometric shadow. The wavefront at 
height h above 0 makes a contribution to the disturbance at P which has a phase lag of irh 2 /\t with 
respect to that from 0. The total disturbance at P is the vector sum (amplitude and phase) of all 
contributions from the wavefront section above 0 




k 2 k 3u 4 k A (h) in half period units 



Figure 12.45 Variation of the width of each half period zone with height h above the straight edge 
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and 



d y — d h sin A 

If we now plot the vector sum of these contributions the total disturbance at P from that 
section of the wavefront measured from O to a height h will have the component values 
x — j dr and y = J dv. These integrals are usually expressed in terms of the dimensionless 
variable u = h(2/\I) 1//2 , the physical significance of which we shall see shortly. 

We then have A = mi 2 / 2 and Ah = (A//2) 1 Au and the integrals become 



and 



x = 



Ax — 



cos ( 7 rn 2 / 2 ) Au 



y = dv = 



sin ( me / 2 ) Au 



These integrals are called Fresnel’s Integrals and the locus of the coordinates x and y 
with variation of u (that is, of h) is the spiral in the first quadrant of Figure 12.42. The 
complete figure is known as Cornu's spiral. 

As h, the width of the contributing wavefront above the straight edge, increases, we 
measure the increasing length u from 0 along the curve of the spiral in the first quadrant 
unit, as h and u ->oowe reach Z\ the centre of the spiral eye with coordinate x — \ , y — 
The tangent to the spiral curve is 



dv mi 1 
— = tan — 



and this is zero when the phase 

A (h) = nh 2 /XI = 7 rn 2 /2 = nm 

where m is an integer so that u = ff (2m) relates u, the distance measured along the spiral to 
m the number of half period zones contributing to the disturbance at P. The total intensity at 
P due to all the half period zones above the straight edge is given by the square of the 
length of the ‘chord’ OZ i . This is the intensity at the geometric shadow. 

Suppose now that we keep P fixed as we slowly withdraw the screen vertically 
downwards from O. This begins to reveal contributions to P from the lower part of the 
wavefront; that is, the part which contributes to the Cornu spiral in the third quadrant. The 
length u now includes not only the whole of the upper spiral arm but an increasing part of 
the lower spiral until, when u has extended to Z 2 the ‘chord’ Z i Z 2 has its maximum value 
and this corresponds to the fringe of maximum intensity nearest the straight edge. Further 
withdrawal of the screen lengthens u to the position Z 3 which corresponds to the first 
minimum of the fringe pattern and the convolutions of an increasing length u around the 
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spiral eye will produce further intensity oscillations of decreasing magnitude until, with the 
final removal of the screen, u is now the total length of the spiral and the square of the 
‘chord’ length Z | Zj gives the undisturbed intensity of unit value. But Z 1 Z , 1 = 2Z\0 so 
that the undisturbed intensity fLfL'f) 1 is a factor of four greater than (ZiO ) 2 the intensity 
at the geometric shadow. 

The Fresnel diffraction pattern from a slit may now be seen as that due to a fixed height h 
of the wavefront equal to that of the slit width. This defines a fixed length u of the spiral 
between the end points of which the ‘chord’ is drawn and its length measured and squared 
to give the intensity. At a given distance from the slit the intensity at a point P in the 
diffraction pattern will correlate with the precise location of the fixed length u along the 
spiral. At the centre of the pattern P is symmetric with respect to the upper and lower edges 
of the slit and the fixed length u is centred about O (Figure 12.46). As P moves across the 
pattern towards the geometric shadow the length u moves around the convolutions of the 
spiral. In the geometric shadow this length is located entirely within the first or third 
quadrant of the spiral and the magnitude of the ‘chord’ between its ends is less than 
OZ]. When the slit is wide enough to produce the central minimum of the diffraction 
pattern in Figure 12.47 the length u is centred at O with its ends at Z 3 and Z 4 in 
Figure 12.46. 




Figure 12.46 The slit width h defines a fixed Length u of the spiral. The intensity at a point P in the 
diffraction pattern is correlated with the precise location of u on the spiral. When P is at the centre 
of the pattern u is centred on 0 and moves along the spiral as P moves towards the geometric 
shadow. Within the geometric shadow the chord joining the ends of u is less than OZj 
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Figure 12.47 FresneL diffraction pattern from a slit which is wide enough for the spiral length u to 
be centred at 0 and to end on points Z 3 and Z 4 of Figure 12.46. This produces the intensity minimum 
at the centre of the pattern 



Circular Aperture (Fresnel Diffraction) 

In this case the half period zones become annuli of decreasing width. If r„ is the mean 
radius of the half period zone whose phase lag is rnr with respect to the contribution from 
the central ring the path difference in Figure 12.48 is given by 

NP - OP = A = n A/2 = \r;Jl 

Unlike the rectangular example of the straight edge where the area of the half period 
zone was proportional to its width d h each zone here has the same area equal to nXl. 

Each zone thus contributes equally to the disturbance at P except for a factor arising 
from the rigorous Kirchhoff theory which, until now, we have been able to ignore. This 
is the so-called obliquity factor cos x where x is shown in the figure. This factor is 
negligible for small values of n but its effect is to reduce a zone contribution as n 
increases. A large circular aperture with many zones produces, in the limit, an undisturbed 
normal intensity on the axis and from Figure 12.49 where we show the magnitude 
and phase from successive half zones we see that the sum of these vectors which represents 
the amplitude produced by an undisturbed wave is only half of that from the innermost 
zone. 

It is evident that if alternate zones transmit no light then the contributions from the 
remaining zones would all be in phase and combine to produce a high intensity at P similar 
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Figure 12.48 Fresnel diffraction from a circular aperture. The mean radius r n defines the half period 
zone with a phase lag of nir at P with respect to the contribution from the central zone. The obliquity 
angle x which reduces the zone contribution at P increases with n 



to the focusing effect of a lens. Such circular ‘zone plates’ are made by blacking out the 
appropriate areas of a glass slide, Figure 12.50. A further refinement increases the intensity 
still more. If the alternate zone areas are not blacked out but become areas where the 
optical thickness of the glass is reduced, via etching, by A/2 the light transmitted through 
these zones is advanced in phase by 7r rad so that the contributions from all the zones are 
now in phase. 




Figure 12.49 The vector contributions from successive zones in the circular aperture. The 
amplitude produced by an undisturbed wave is seen to be only half of that from the central zone. 
Removing the contributions from alternate zones leaves the remainder in phase and produces a very 
high intensity. This is the principle of the zone pLate of Figure 12.50 
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Figure 12.50 Zone plate produced by removing alternate half zones from a circular aperture to 
leave the remaining contributions in phase 



Holography 

Why is it that when we observe an object we see it in three dimensions but when we 
photograph it we obtain only a flat two dimensional distribution of light intensity? The 
answer is that the photograph has lost the information contained in the phase of the 
incident light. Holographic processes retain this information and a hologram reconstructs a 
three-dimensional image. 

The principle of holography was proposed by Gabor in 1948 but its full development 
needed the intense beams of laser light. A hologram requires two coherent beams and the 
holographic plate records their interference pattern. In practice both beams derive from the 
same source, one serves as a direct reference beam the other is the wavefront scattered from 
the object. 

Figure 12.51 shows one possible arrangement where a partly silvered beam splitter 
passes the direct reference beam and reflects light on to the object which scatters it on to 
the photographic plate. Mirrors or deviating prisms are also used to split the incident beam. 

In Figure 12.51 let the reference beam amplitude be A o e lwt . If the holographic plate lies 
in the yz plane both the amplitude and phase of scattered light which strikes a given point 
(y, z) on the plate will depend on these co-ordinates. We simplify the analysis by 
considering only the y co-ordinate shown in the plane of the paper and we represent the 
scattered light in amplitude and phase as a function of y, namely 

A(y) 



It is this information we shall wish to recover. 
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Figure 12.51 The hologram records the interference between two parts of the same laser beam. The 
original beam is divided by the partially silvered beam splitter to form a direct reference beam and a 
wavefront scattered from the object. The amplitude and phase information contained in the 
scattered wavefront must be preserved and recovered 



We may now write the resulting amplitude at y (after removing the common e‘“' 
factor) as 

A=A 0 +A(y)e i * w 



The intensity is therefore 

/ = AA* = [A 0 +A(y)e^M][A 0 + A(y) e -*M] 
- A 2 0 +A(y) 2 +A 0 A(y)[e i *W + e^J 



The holographic plate records this intensity as shown in Figure 12.52 where the 
reference intensity Aq is modulated by the terms which contain A (y) and the original 
scattered amplitude and phase information. This modulation shows of course as contrasting 
interference fringes whose local intensity is governed by the amplitude A(y) and whose 
distribution along the v axis is determined by the phase f>(y). The wavefront scattered by 
the object is now reconstructed to form the holographic image. This is done by shining the 
reference beam through the processed hologram which acts as a diffraction grating. The 
greater the recorded intensity the lower the transmitted amplitude. If the developed 
photographic emulsion possessed idealized characteristics the relation between 
the transmitted amplitude of the reference beam and the exposure would be linear. 
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Figure 12.52 Total intensity recorded as a function of y by the holographic plate. The direct 
reference beam intensity A jj is modulated by information from the scattered wavefront. This shows as 
variations in the intensity of an interference fringe pattern 



Exposure defines the product of incident intensity and exposure time. The curve relating 
the characteristics for a real holographic emulsion is shown in Figure 12.53 and this is 
linear only over a limited range near the centre indicated by the dotted lines. This imposes 
several conditions on practical holography. 

In the first place the exposure must be correctly chosen at the value Eq. Secondly, the 
value of the reference beam intensity A , 2 > must be chosen to produce the correct transmitted 
amplitude To on the vertical axis of Figure 12.53. This value of To is at the centre of the 
linear range. Finally, the modulation of A 5 by the scattered intensity A(y) 2 in Figure 12.53 
must be small enough for the transmission of the modulated signal to remain within the 
linear range of the characteristic curve. Excursions outside this range introduce non-linear 
distortions by generating extra Fourier frequency components (the situation is similar to 
that for characteristic curves in electronic amplifiers). 

Experimentally this final restriction requires A(y) <C /l 0 . 

Shining the reference beam through the processed hologram produces a transmitted 
amplitude 



AoT = Al + A§A(y) e 1 ^ + A*A(y) e“^W 
-A,iA 0 • A (y) e io '- y ® +A(y)e^W J 



where we have neglected the A(y ) 2 term as <CAq and have written the negative and 
positive exponential terms separately. This has a profound physical significance for we see 
that apart from the common constant factor A q, the observed transmitted beam has three 
components Aq, A(y) e^W and A(y)e-^W. 
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Amplitude transmittance 




Figure 12.53 Characteristic curve of a real holographic emulsion (transmittance versus exposure). 
Only the central linear section of the curve is used. The transmittance T 0 (governed by the reference 
beam intensity A §) is chosen with the critical exposure Eq to produce the central point on the linear 
part of the curve. Information from the scattered wavefront must keep the modulations within the 
linear range for faithful reproduction free from distortion 



Plane of 




Figure 12.54 (a) Shining the reference beam through the processed hologram produces three 
components A 0 ,A(y) e and A(y) e - "^ in the directions shown. Movement of the eye from X to Y 
about the component A(y) e"^ resolves the separate points 0 and O' on the image of the object to 
reveal its three dimensional nature, (b) This image at 0 is seen to be virtual while the image 
associated with the component A(y) e ~ is reaL. This real image is 'phase reversed' and the object 
appears 'inside out' 
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The first term, Ao, shows that the incident reference beam has continued beyond the 
hologram to form the central beam of Figure 12.54a. The second component A(y) e^M has 
the same form in amplitude and phase as the original wavefront scattered from the object. 
As shown in Figure 12.54b it is seen to be a wavefront diverging from a virtual image of 
the object having the same size and three dimensional distribution as the object itself. 
Moving the eye across this beam in 12.54a exposes a different section OO' of the virtual 
image to produce a three dimensional effect. 

The third component of the transmitted beam is identical with the second except for the 
phase reversal; it has a negative exponential index. It forms another image, in this case a 
real image often referred to as 'pseudoscopic’. It is an image of the original object turned 
inside out. All contours are reversed, bumps become dents and the closest point on the 
original object when viewed directly by the observer now becomes the most distant. 



Problem 12.1 

Suppose that Newton’s Rings are formed by the system of Figure 12.4 except that the piano convex 
lens now rests centrally in a concave surface of radius of curvature R i and not on an optical flat. 
Show that the radius r n of the nth dark ring is given by 

r 2 n =R 1 R 2 n\/{Ri-R 2 ) 

where Ri is the radius of curvature of the lens and R \ > R 2 (note that R \ and R i have the 
same sign). 

Problem 12.2 

Light of wavelength A in a medium of refractive index n i is normally incident on a thin film of 
refractive index n 2 and optical thickness A/4 which coats a plane substrate of refractive index n 2 . 
Show that the film is a perfect anti-reflector (r = 0) if n\ = n\ti 2 . 

Problem 12.3 

Two identical radio masts transmit at a frequency of 1500 kc s -1 and are 400 m apart. Show that the 
intensity of the interference pattern between these radiators is given by I = 2/o[l + cos (4zr sin 6?)], 
where 7o is the radiated intensity of each. Plot this intensity distribution on a polar diagram in which 
the masts lie on the 90° -270° axis to show that there are two major cones of radiation in opposite 
directions along this axis and 6 minor cones at 0°, 30°, 150°, 180°, 210° and 330°. 

Problem 12.4 

(a) Two equal sources radiate a wavelength A and are separated a distance A/2. There is a phase 
difference Sq = 7r between the signals at source. If the intensity of each source is 7 S , show that the 
intensity of the radiation pattern is given by 

7 = 47 s sin 2 sin$) 

where the sources lie on the axis ±ir/2. 

Plot 7 versus 8. 

(b) If the sources in (a) are now A/4 apart and So = 7t/2 show that 

7 = 47 s |^cos 2 — (1 + sin#)J 



Plot 7 versus 8. 
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Problem 12.5 

(a) A large number of identical radiators is arranged in rows and columns to form a lattice of which 
the unit cell is a square of side d. Show that all the radiation from the lattice in the direction 9 will be 
in phase at a large distance if tan# = m/n, where m and n are integers. 

(b) If the lattice of section (a) consists of atoms in a crystal where the rows are parallel to the crystal 
face, show that radiation of wavelength A incident on the crystal face at a grazing angle of 9 is 
scattered to give interference maxima when 2d sin# = n\ (Bragg reflection). 

Problem 12.6 

Show that the separation of equal sources in a linear array producing a principal maximum along the 
line of the sources (# = ±7r/2) is equal to the wavelength being radiated. Such a pattern is called 
‘end fire’. Determine the positions (values of #) of the secondary maxima for N = 4 and plot the 
angular distribution of the intensity. 

Problem 12.7 

The first multiple radio astronomical interferometer was equivalent to a linear array of N = 32 
sources (receivers) with a separation / = 7 m working at a wavelength A = 0.21 m. Show that the 
angular width of the central maximum is 6 min of arc and that the angular separation between 
successive principal maxima is 1°42'. 

Problem 12.8 

Monochromatic light is normally incident on a single slit, and the intensity of the diffracted light at 
an angle 9 is represented in magnitude and direction by a vector I, the tip of which traces a polar 
diagram. Sketch several polar diagrams to show that as the ratio of slit width to the wavelength 
gradually increases the polar diagram becomes concentrated along the direction # = 0. 

Problem 12.9 

The condition for the maxima of the intensity of light of wavelength A diffracted by a single slit of 
width d is given by a = tan a, where a = ird sin #/ A. The approximate values of a which satisfy this 
equation are a = 0, +37r/2, +57t/2, etc. Writing a = 2nr/2 — <5, 57t/2 — 8, etc. where 8 is small, 
show that the real solutions for a are a = 0, ±1.437r, ±2.4597t, ±3.4717t, etc. 

Problem 12.10 

Prove that the intensity of the secondary maximum for a grating of three slits is | of that of the 
principal maximum if only interference effects are considered. 

Problem 12.11 

A diffraction grating has N slits and a grating space/. If /3 = 7r/sin#/A, where 9 is the angle of 
diffraction, calculate the phase change d/3 required to move the diffracted light from the principal 
maximum to the first minimum to show that the half width of the spectral line produced by the 
grating is given by d#= (w/Vcot#) -1 , where n is the spectral order. (For N = 14,000, n= 1 and 
#=19°, d#« 5 s of arc.) 

Problem 12.12 

(a) Dispersion is the separation of spectral lines of different wavelengths by a diffraction grating and 
increases with the spectral order n. Show that the dispersion of the lines when projected by a lens of 
focal length F on a screen is given by 

d / _ d# _ nF 

d\~ F d\ ~Y 
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for a diffraction angle 9 and the nth order, where l is the linear spacing on the screen and / is the 
grating space. 

(b) Show that the change in linear separation per unit increase in spectral order for two wavelengths 
A = 5 x 10 -7 m and A 2 = 5.2 x 10~ 7 m in a system where F = 2m and /= 2 x 10~ 6 m is 
2 x 10 " 2 m. 

Problem 12.13 

(a) A sodium doublet consists of two wavelength Ai = 5.890 x 10 -7 m and A 2 = 5.896 x 10~ 7 m. 
Show that the minimum number of lines a grating must have to resolve this doublet in the third 
spectral order is ~ 328. 

(b) A red spectral line of wavelength A = 6.5 x 10 ~ 7 m is observed to be a close doublet. If the two 
lines are just resolved in the third spectral order by a grating of 9 x 10 4 lines show that the doublet 
separation is 2.4 x 10 _2 m. 

Problem 12.14 

Optical instruments have circular apertures, so that the Rayleigh criterion for resolution is given by 
sind = 1.22A /a, where a is the diameter of the aperture. 



A 




OA || O'A 

Two points O and O' of a specimen in the object plane of a microscope are separated by a distance 
5. The angle subtended by each at the objective aperture is 2 i and their images I and I' are just 
resolved. By considering the path difference between O'A and O'B show that the separation 
s = 1.22A/2sini. 



Summary of Important Results 



Interference: Division of Wavefront (Two Equal Sources) 
Intensity 



where 



I = 4 I s cos 2 6/2 
I s = source intensity 



and 



6 = 



2ir 

T 



(path difference) 



is phase difference 
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Interference (N Equal Sources - Separation f) 



Principal Maxima 



I = 



sin 2 N (3 
sin 2 (3 



where (3 



— / sin 6 

A 



I — N 2 I s at/sin# = «A 



Fraunhofer Diffraction (Single Slit - Width d) 

Intensity 

, , sin2a u 71- , • a 

I = /o r— where a = — d sin 0 

a- A 



Intensity Distribution from N Slits (Width d - Separation f) 



I = 



I o 



sin 2 asin 2 A7/9 
a 2 sin 2 /? 



(interference pattern modified by single slit diffraction envelope). 



Resolving Power of Transmission Grating 



where n is spectral order and N is number of grating lines: 

Expressible in terms of Bandwidth Theorem as 

AuAt = 1 

where Av is resolvable frequency difference and At is the time difference between extreme 
optical paths. 

Resolving power 



A 

AA 



v 

Av 



iO 

A co 



Q 



where Q is the quality factor of the system. 
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Wave Mechanics 



The wave mechanics of Schrodinger (1926) and the equivalent matrix formulation by 
Heisenberg (1926) are the basis of what is known as ‘modern physics’. Without exception 
they have been successful in replacing or including classical mechanics over the whole 
range of physics at atomic and molecular levels; these in turn govern the larger scale 
macroscopic properties. Very high energy phenomena in the physics of elementary 
particles still, however, present many problems. 

In this chapter we shall be concerned only with Schrodinger’s wave mechanics and in the 
way it displays the dual wave-particle nature of matter. This dual nature was first 
established for electromagnetic radiation but the parallel attempt to establish the wave 
nature of material particles is the basic history of twentieth century physics. 

Origins of Modern Quantum Theory 

In the nineteenth century interference and diffraction experiments together with classical 
electromagnetic theory had confirmed the wave nature of light beyond all doubt but in 
1901, in order to explain the experimental curves of black body radiation, Planck 
postulated that electromagnetic oscillators of frequency v had discrete energies nhv where 
n was an integer and h was a constant (p. 252). A quarter of a century was to elapse before 
this was formally derived from the new quantum mechanics. 

X-rays had been found by Roentgen in 1895, their wave-like properties were displayed 
by the diffraction experiments of von Laue in 1912, and their electromagnetic nature was 
soon proved. A much longer time was required to reconcile a wave nature with the 
negatively charged particles which J. J. Thomson found in his cathode ray experiments of 
1897. It was not until 1927 that interference effects from reflected or scattered electrons 
were obtained by Davisson and Germer whilst in 1928 G. P. Thomson (the son of J. J.) 
produced concentric ring diffraction patterns by firing electrons through a thin foil. 

In the meantime, in 1906, Einstein had used Planck's idea to explain the photoelectric 
effect where light falling on a given surface caused electrons to be ejected. Einstein 
considered the light beam as a stream of individual photons, or quanta of light, each of 
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energy hv. Collisions between these quanta and electrons in the target material gave the 
electrons sufficient energy to escape. 

In 1912 the alpha particle scattering experiments of Rutherford led to his proposal that 
the atom consisted of a small positively charged nucleus surrounded by enough negative 
electrons to leave the atom electrically neutral. This atom was the model used by Bohr and 
Sommerfeld in their ‘old quantum theory’, a mixture of classical mechanics and quantum 
postulates, attempting to explain, amongst other things, the regularity of spectroscopic 
series from radiating atoms. Electrons were required to orbit the nucleus at definite energy 
levels (like planets round the Sun), and radiation at a fixed frequency v was given out when 
an electron moved from a higher to a lower energy orbit with an energy difference 
A E = hv. These orbits were required to be stable or ‘stationary’ orbits with quantized, that 
is, allowed values of energy and angular momentum. The fact that classical 
electromagnetic theory had shown that an accelerating charge (electron in an orbit) was 
itself a source of radiation remained an unresolved difficulty. 

By 1920 Einstein had provided two of the vital tools necessary for further progress (a) 
that a quantum of radiation has energy E = hv, and (b) that a particle of momentum 
p — mv and rest mass m o has a relativistic energy E where E 2 = p 2 c 2 + (moc 2 ) 2 . 

This relation established the equivalence of matter and energy; a stationary particle 
v — 0 has an energy E = moc 2 where c is the velocity of light. 

The time was now ripe for the final steps leading to the modern quantum theory. The first 
of these was provided by Compton (1922-23) and the second by de Broglie in 1924. 

Compton fired X-rays of a known frequency at a thin foil and observed that the 
frequency v of the scattered radiation was independent of the foil material. This implied 
that the scattering was the result of collisions between X-ray quanta of energy hv and the 
electrons in the target material. In addition to scattering at the incident frequency a lower 
frequency of scattered radiation was always found which depended only on the mass of the 
scattering particles (electrons) and the angle of scattering. Compton showed that these 
results were consistent if momentum and energy were conserved in an elastic collision 
between two ‘particles’, the electron and an X-ray of energy hv, a rest mass m o = 0 and 
(from Einstein’s relativistic energy equation), a momentum 



E hv h 




where c = v X. 

De Broglie in 1924 proposed that if the dual wave-particle nature of electromagnetic 
fields required a particle momentum of p = h / A, it was possible that a wavelength A of a 
‘matter’ field could be associated with any particle of momentum p = mv to give the 
relation p = h/X. His argument was as follows. 

If the phase velocity of such a ‘matter’ wave obeys the usual relation 

n p = vX 

where v is the frequency, the assumption that any particle has a momentum p = h/X 
together with Einstein’s expression E = hv yields n p = E/p. 
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The theory of relativity gives, for a particle of rest mass m o and velocity v an energy 
E = me 2 and a momentum p = mv, where 



/ n = mo ^1 — 

is the particle mass at velocity v. For such a particle the phase velocity 



E c 2 





that is. 



vv v = c 2 



(an expression we met earlier for the wave guides of p. 243). 

This gives a phase velocity v p > c for a particle velocity v < c. However, the energy in 
the de Broglie wave (or particle) travels with the group velocity 



which, for 



dui 

dk 



E = hu = — ui 
2n 



and 



gives 



P 



h h , 
\~7^ k 



duj _dE 
dk dp 



Such a particle with relativistic energy E where 

E 2 = p 2 c 2 + (; m 0 c 2 ) 2 



2 E 



dE 

dp 



2 pc 2 



has 
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or 

dE pc 2 vc 2 

Vg = dp = lf = ^ =V 

so that the group velocity of de Broglie matter wave corresponds to the particle velocity v. 

Even the ‘old quantum theory' of Bohr-Sommerfeld gained something from de 
Broglie’s hypothesis. Their postulate that the angular momentum of stationary orbits was 
restricted to integral (quantum) numbers of the unit angular momentum h was shown, for 
the circular orbit of radius r, to yield 



2irrp = nil 



or 



2 „ = d! = „ A 

P 



so that the circumference of a stationary orbit was a standing wave system and contained an 
integral number n of A, the de Broglie wavelength. 

Within three years, however, such quantum numbers ceased to be assumptions. They 
were the natural outcome of the new quantum theory of Schrodinger and Heisenberg. 



Heisenberg's Uncertainty Principle 

Although, as we shall see, Schrodinger’s equation takes the form of a standing wave 
equation, the fitting of an integral number of de Broglie standing waves around a Bohr orbit 
presents a fundamental difficulty. The azimuthal symmetry of such a pattern. Figure 13.1, 




Figure 13.1 Integral number of de Broglie standing waves A = h/p around a circular Bohr orbit 
does not allow the exact position of the electron to be specified at a particular time 




Heisenberg ’s Uncertainty Principle 



415 



representing an electron in an orbit, does not allow the exact position of the electron to be 
specified at a particular time. This dilemma was resolved by Heisenberg on the basis of the 
Bandwidth Theorem we first met on p. 134. 

There, a group of waves with a group velocity v g and a frequency range A v superposed 
effectively only for a time At where 



Av At « 1 

Similarly, a group in the wave number range Ak superposed in space over a distance Ax 
where 



Ax Ah rs 27 r 

But the velocity of the de Broglie matter wave is essentially a group velocity with a 
momentum 



where 



P 



h h 

A = 2i k = 



Hk 




so 



A p — li Ak 

and the Bandwidth Theorem becomes Heisenberg’s Uncertainty Principle 

Ax A p « h 



Since 



it follows that 



E = liv — — lo = hu> 
2t r 



— — = A E At k h 
Av 



and 



A E k, TiAu) 



are also expressions of Heisenberg’s Uncertainty Principle. 
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Ax 




Same group 
after time t 



Figure 13.2 A wave group representing a particle showing dispersion after time t. The square of the 
wave amplitude at any point represents the probability of the particle being in that position, and the 
dispersion represents the increasing uncertainty of the particle position with time (Heisenberg's 
Uncertainty Principle) 



This relation sets a fundamental limit on the ultimate precision with which we can know 
the position x of a particle and the x component of its momentum. If Figure 13.2 shows a 
wave group representing the particle, the range Ax shows the uncertainty of the position of 
the particle in the range of space over which it could be found, with the probability of its 
being at a particular place given by the square of the wave amplitude of that position. The 
relation 



Ax A p m h 

means that the velocity of the particle ( p — mv ) is also uncertain, the more accurate the 
knowledge of the particle position, the less certain that of the value of its velocity. If the 
particle is ‘observed' at some later time, dispersion of the group will have increased the 
range Ax and decreased the amplitude. The uncertainty of the position has increased and 
the probability of its being at any one place has become less. But this is because of the 
original uncertainty of its velocity, through A p, which makes an accurate forecast of its 
position after time t even more unlikely. 

The shape of the wave group above is often taken as a Gaussian curve written 'T (x, t) 
with a width Ax at t = 0 where the value 'I'(x. t) is e 1 of its maximum value (see p. 289). 

P(xt) defines the probability density of finding the particle at a position Ax, i.e. within 
the range x and x + Ax. 

The position x and momentum p x of a particle are conjugate parameters, so the 
representation of the particle in momentum space <T> (p x , t) is the Fourier transform of 
fi^(x, t) and ( l ) (p x , t) is also a Gaussian curve with a width A p x where < h(/? A f) is e” 1 of its 
maximum value. 

If the group velocity of the wave packet is v g — po/m a rigorous treatment of the time 
development of these functions leads to the conclusion that P{xt) falls to e 1 of its 
maximum value at the points where 



x — Vgt = ±Ax 
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where 



A x(t) 



h 

Px 



1 + 



(A Px )\ 2 

m 2 H 2 



and hence increases with time. 

If the time is sufficiently small so that 

t < h 



mh 

i^Px ) 2 



the second term in the bracket is negligible and the wave packet propagates with only a 
very small change in its width. 

As an example, a Gaussian wave packet for an electron localized at time t = 0 to within 
a distance of 10~ 1() m (atomic dimensions) with A p x = fi/Ax « 10 24 kg ■ m ■ s“' will have 
spread to twice its size at time t = fi\/3 ~ 10~ 16 s. 

An example of the relation 

A E At h 

may be found in considering the time spent by an electron in an atomic orbit. In a stable 
orbit this time At is long and the energy uncertainty A E is small so the energy levels of 
stable orbits are well defined. When an electron changes energy levels and radiation is 
emitted the time in the orbit may be short and the energy levels ill defined so that the term 
A E contributes to the breadth of a spectral line. 



(Problems 13.1, 13.2, 13.3, 13.4, 13.5, 13.6, 13.7, 13.8, 13.9, 13.10) 



Schrodinger's Wave Equation 

The old quantum theory had sought to establish rules for the existence of discrete 
frequencies and energy levels. An integral number of de Broglie half wavelengths could be 
fitted around a circular Bohr orbit. Both of these facts are consistent with the classical 
standing wave systems we examined in Chapters 5 and 9 when waves travelling between 
rigid boundaries were perfectly reflected. 

In Chapter 5 we saw that the transverse displacement y(xt) of a string of length / with 
both ends fixed obeys the wave equation 

d 2 y 1 d 2 y 

dx 2 v 2 dt 2 



where v p is the wave velocity. 

The x and t dependence could be separated in the solution for standing waves to give 



, . , . LU„X . 

y(x, t) = A sin sin co„t 
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where n could take the integral values n = 1, 2, 3, etc. to give the discrete eigenfrequencies, 

nirv v 
u n = — 

The solution y(x, t ) corresponding to a given lu„ is called an eigenfunction or a wave 
function. 

In developing the Schrodinger wave equation which applies to particle behaviour we use 
arguments below which in no way constitute a proof because wave mechanics cannot be 
derived from classical mechanics. Wave mechanics is based on certain postulates the 
validity of which can be confirmed only by the accuracy of the predicted results. 

From the preceding sections we have the representation of a particle as a matter wave 
with energy E = huj, momentum p = f/k and velocity v„ = dou/dk. 

Wave mechanics uses the notation 



V(x,t) = T'o 0 e i(p * _ ® )/s 

to define the amplitude of a matter wave at a point x at time t. The physical significance of 
ip is amplified on p. 422 but for the moment we note the reversed sign of the exponential 
index which follows the convention used in all books on quantum mechanics. This merely 
introduces a 7 r rad phase difference from the notation consistently used in the earlier 
chapters of this book but the new convention will be used throughout this chapter to avoid 
confusion with other texts and attention will be carefully drawn to any possible ambiguity. 

In classical mechanics the total energy £ of a particle of mass m and momentum p in a 
conservative field of potential V is given by 

E = p 2 /2m + V 



Differentiating 'I' (x.t) gives 



d 2 —p‘ 



and inserting this value of p 2 in the classical energy equation above gives 






If we now express T (x, t ) = 'ip(x) e VjJt we may cancel the common e l:jJ ' factor from the 
equation above to obtain the time independent Schrodinger wave equation 



2m dx 2 



ip(x) + (E — V)ip{x) = 0 



This time independent wave equation will give states of constant frequency, that is, of 
constant energy , and these are the only states we shall consider in this book. 
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Note that this equation has the same form as the standing wave equation we first met on 
p. 124. 

States which are not of constant energy require the time dependence to be retained in 
Schrodinger’s equation. We do this by using the fact that 

d -i E 

s »(m)= t *(u) 

and inserting this value of E in the classical energy equation. This gives the time dependent 
Schrodinger wave equation 



One-dimensional Infinite Potential Well 

Consider as a first example the case of a particle constrained to move in a region between 
x — 0 and x = a where the potential V — 0. At x = 0 and x = a the potential walls are 
infinitely high as shown in Figure 13.3. This is an idealized form of the potential seen by an 
electron in the low energy levels near the nucleus of an atom. 



V(x) 




Figure 13.3 An infinitely deep potential well showing allowed energy levels E n for a particle 
constrained to move within it with wave function ip n = Asmk n x where k 2 n = 2mE/fi 2 and m is the 
particle mass 
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Since V{x) = 0 for 0 < x < a Schrodinger’s equation becomes 

d 2 ip{x) 2m E 



dx 2 7r 

which may be written, as on p. 124, in the form 

d 2 ip 



ip — 0 



dx 2 



+ k ip = 0 



K = 7j~ 

h- 

The boundary conditions are that ip{x) =0 at x = 0 and x = a where V (x) becomes 
infinite, whilst the other terms in the equation remain finite. The particle must lie within the 
well and classically, whatever the value of its energy E it will rebound elastically off the 
potential ‘walls’. When moving to the right the particle behaviour may be represented by a 
wave function of the form e 1 ' kx which satisfies Schrodinger’s equation, and when moving 
to the left by a wave function of the form 

e~ ikx 

But, as with the waves on the string, perfect reflection which reverses the amplitude 
allows ip n (x), the solution of Schrodinger's equation, to represent a standing wave system 
at lo„; expressed in the form 



ip n {x) = Ce^' - Ce - *"* 
= A sin k n x 



where 



A 



C 

2i 



The boundary condition ip„(x) = 0 at x = a gives k„a = mr for n — 1,2, 3, etc. i.e. 
k n = tm/a. 

Hence 



2 _ 2mE„ 
" _ h 2 



2 

n 7r 



2 



giving energy eigenvalues 



n 2 TT 2 h 2 



E, 



a 2 2m 
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Thus, we see that discrete energy values governed by the quantum number n arise naturally 
from the application of boundary conditions to the wave function solutions of 
Schrodinger’s equation. Values of the particle momentum are also quantized since 



P = 




n nit 
a 



It is evident that in an infinite potential well, an electron or particle cannot have an arbitrary 
energy but must take only the quantized values E„. This restriction will hold whenever 
Schrodinger’s equation is solved for a potential V(x) which imposes boundary conditions 
constraining the particle to move in a limited region. 

The wave functions ip„(x) for n — 1, 2, 3 are plotted in Figure 13.4 showing them to be 
identical with the allowed amplitude functions for standing waves on a vibrating string 
with fixed ends. Note that the interval between allowed energy states decreases as either the 
mass of the particle or the dimensions of the potential box increase relative to h. For 
particles of large mass and systems of large dimensions the allowed energy states form, for 
all practical purposes, a continuum and are no longer quantized. Thus, in passing from 
atomic to much larger dimensions the results of quantum mechanics approach those of 
classical physics. 

We see that the minimum value of the energy of the particle in the potential well is not 
zero but 



li 2 ir 2 
1 2 ma 2 




x= 0 x = a x= 0 x= a 



Figure 13.4 Wave functions ip n (x) and probability densities \ip„(x)\ 2 for the first three allowed 
energy levels in an infinitely deep potential weLL of width a 
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This minimum energy is related to Heisenberg’s Uncertainty Principle 

Ax A p ss h 

The uncertainty in the position of the particle is obviously Ax = a and the particle 
momentum p may be in either the positive or negative x direction giving an uncertainty 

A p = 2 p 



Thus 



Ax A p = a ■ 2p « h 



or 



P 



h 
2 a 



Now, for V(x) =0 

p 2 h 2 h : 7T 2 

E — £__ ~ ~ 

2m 8ma 2 2 ma 2 

This is an example of the so-called zero point energy. We shall meet others. 

(Problem 13.11) 

Significance of the Amplitude i// n (x) of the Wave Function 

In Figure 13.4 the amplitude ip n (x) of the wave function is plotted for the values n = 1,2,3 
together with the values 



IV’nWI 2 

In the waves we have met so far, the amplitude, or rather the amplitude squared, has been a 
measure of the intensity of the wave. At a position of high amplitude, the wave was more 
intense — more energy was localized there. Here we have expressed the motion of a particle 
confined to a small region of space in terms of its associated matter wave. The amplitude of 
the wave function ^(x) varies from point to point within the small region in which the 
particle is to be found. Outside the infinite well ip(x) is zero. The intensity of the matter 
wave is written 



IV’WI 2 = 

where the complex conjugate tp*(x) indicates that 'ip(x') may sometimes be complex. Since 
the matter field describes the motion of the particle we may say that the regions of space in 
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which the particle is more likely to be found are those in which the intensity \f(x) | 2 is 
large, or, more formally 

‘the probability of finding the particle described by the wave function tp(x) in the 
interval dr around the point x is \'tl'(x) \ 2 dr . 

The probability per unit length of finding the particle at x is 

P{x) = \fi{x)\ 2 

In three dimensions a wave function would be of the form y, z) and the probability of 
finding the particle in the unit volume element surrounding the point xyz is 

P(xyz) = \ fi(xyz)\ 2 

The probability of finding the particle within a finite volume V is obviously 



Pv = 



\fi(xyz)\ 2 drdy dz 



J v 

Now the particle must always be somewhere in space so, in extending the integral over all 
space, the probability becomes a certainty; that is, it equals unity, or 



1 4>(xyz) | 2 dx dy dz = 1 

J all space 



This process of integrating over all possible locations to give unity is called 
normalization and it always imposes restrictions on the form of ip{x, y, z) which must 
tend to zero as x, y or z tends to infinity. 

Normalization determines the value of the constant A in our wave function 



/ , x . . nTTX 
ip„(x) = A sin 



for the case of the infinite potential well. 
There 



\fi„(x)\- dx = 

= A 



IV’nWr dr 



. , nnx , , a 

sin d x — A - = 1 

a 2 



Hence 
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and the normalized wave function 



tynix) 




mrx 

a 



(Problem 13.12) 



Particle in a Three-dimensional Box 

Suppose the particle is confined to a rectangular volume abc at the bottom of an infinitely 
deep potential well (V = 0) where a, b and c are the lengths of the sides of the rectan- 
gular box. 

The energy of the particle is then 
where the momentum components are 



Px = n i 



P y = n 2 



7T Tl 

a 

7 tH 

b 

Tili 



Pz = « 3 - 



and m, n 2 and n 3 are integers. 

The energy levels allowed in the box are therefore given by 



E = 



Mr 



nr n o n ' 



2m \a 2 b 2 c 2 / 

and solutions for the space part of the wave function may be written 

, , . mmc . «27ry . n^irz 

y,z) = A sin sin sin 

abc 

in accordance with the three-dimensional normal mode solution of p. 249. 
If the box is cubical so that a = b = c the allowed energy levels become 



£ = 



7T“/l 1 , 2 2 7 T 2 h 2 



2 ma 2 



("1 r n 2 ■ //.,) - 



2 ma 2 



where k 2 = n\ 



n\ with wave functions 



n { TTX . n 2 7ry . n 3 irz 

mxyz) = A sin sin sin 

a a a 
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Table 13.1 



Energy 


n 1, n 2, «3 Combinations 


Degeneracy 


3£i 


(1, 1, 1) 


1 


6E 1 


(2, 1, 1) (1, 2, 1) (1, 1, 2) 


3 


9 E x 


(2, 2, 1) (2, 1, 2) (1, 2, 2) 


3 


llEi 


(3, 1, 1) (1, 3, 1) (1, 1, 3) 


3 


12E, 


(2, 2, 2) 


1 


14£ ! 


(1, 2, 3) (3, 2, 1) (2, 3, 1) (1, 3, 2) (2, 1, 3) (3, 1, 2) 


6 



We saw, however, on p. 250 that combinations of different n values can give the same k 
value; that is, the same energy value. When n \ , n 2 and n 2 are permuted without changing 
the k value, the wave function is also changed so that a certain energy level may be 
associated with several different wave functions or dynamical states. The energy level is 
said to be degenerate , the order of degeneracy being defined by the number of different or 
independent wave functions associated with the given energy. 

In the case of the cubic potential box, the lowest energy level is 3E 1 , i.e. 

(n 1 = n 2 = «3 = 1) 



where 

7 x 2 h 2 
1 2 ma 2 

The next energy level is given by 6E \ , with a degeneracy of 3 where the n values are 
given by (2, 1, 1) (1, 2, 1) and (1, 1, 2). Higher energy values with degeneracy orders are 
shown in Table 13.1 above. 



(Problem 13.13) 

Number of Energy States in Interval E to E + d£ 

As long as the dimensions of the cubical box above are small the energy levels remain 
distinct. However, when the volume increases, as is the case for free electrons in a metal, 
successive energy levels become so close that an almost continuous spectrum is formed. 

If we wish to find how many energy levels may be contained in the small energy range 
d E when the potential box is very large, we have only to apply the result of p. 251 where 
we found that the number of possible normal modes of oscillation per unit volume of an 
enclosure in the frequency range v to v + dv is given by 

, 47TZ/ 2 d V 
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There we stressed that the result was independent of any particular system and we applied 
it to Planck’s Radiation Law and Debye’s Theory of Specific Heats. Here we use it with 



^so that 




and p 



E 

c 



hv 

c 



dE — — dp = h dv 
m 



and 



dp 




to give the number of states per unit volume in the energy interval dE as 



d n(E) 



4tt( 2 m 3 ) 1/2 £'/ 2 ^ 
h 3 



This may be applied directly to determine how free electrons in a metal may distribute 
themselves in a band of energies from zero to some value E. Each energy level can 
accommodate two electrons (with opposing spins) according to Pauli’s Principle so the 
total number of electrons per unit volume in the energy range zero to E is 



n = 



dn(E) 



2 ■ 47t(2/?j 3 ) 

1? 



E 1 / 2 dE 



J o 



167r(2m 3 ) 1/2 3/2 

3h 3 



where m e is the electron mass. 

If the metal is in its ground state the available electrons will occupy the lowest possible 
energy levels, and if the total number of electrons per unit volume n t] is less than the total 
number of energy levels in the band, then the electrons will occupy all energy states up to a 
maximum energy E v called the Fermi Energy which is given by 

167r(2m 3 ) 1/2 E^/ 2 

” 0 = 31? 

Typical values of E\ are of the order of 5 eV (1 eV = 1.6 x 10 -19 J). 

(Problems 13.14, 13.15) 

The Potential Step 

The standing wave system of the infinite potential well where the wave function 
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Figure 13.5 Wave functions ip\(x) and ip 2 (x) for a particle mass m, energy E < V at a potential 
step V{x) = 1/ 



is finite in the region V(x) = 0 but zero at all other points is unique in the formal 
correspondence it presents between classical and quantum mechanical results. The 
quantum effects become evident when we consider the general case of the potential step of 
finite height Vin Figure 13.5 which is an idealized form of the very steep potential gradient 
of a conservative force 



F(x) 



dV 

dx 



Such a potential step would be seen by a free electron near the surface of a metal. 

It is necessary to consider separately the two cases where the total particle energy E is (a) 
less than the potential energy Vi and (b) greater than V, where 



E = 



2 m 



+ V(x) 



(a )E<V 

When E is less than V, the region x > 0 of Figure 13.5 is forbidden to the particle by 
classical mechanics for the kinetic energy 



P 



2 



2m 



would then have a negative value. 

In finding the complete solution for ip(x) for the potential step we must solve 
Schrodinger’s equation for the separate regions of Figure 13.5, x < 0 (region 1) and x > 0 
(region 2). 
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In region 1, V(x) = 0 and we have 



d 2 M*) 

dx 2 



2mE 



ipi(x) = 0 



with a solution 



ip i (x)=Ae iklX + Be~ iklX 



where 



, 0 2m E 

k '=lF 

The term A e' k,x (with the sign convention of this chapter) is the wave representation of an 
incident particle moving to the right, and Be~ lk>x represents a reflected particle moving to 
the left. 

In region 2, V(x) = V and Schrodinger's equation becomes 

d 2 M*) , ME-V), , n 



or 



where 



d 2 ip 2 (x) 

dx 2 



a 2 ip 2 {x) = 0 



a 



2 



2m (V - E ) 
h 2 



This equation has the solution 

ip 2 (x) - Ce^ + De® 

Now the probability of finding the particle in region 2 where it is classically forbidden 
depends on the square of the wave function amplitude \xp 2 (x)\ 2 with the condition that for 

any wave function to be normalized ^i.e. for 

|^ 2 (*)| 2 dx= 1^ 



the wave function ipi{x) — > 0 as x — > oo. 

This forbids the second term De 0 * which increases with x but still leaves 



ipi{x) = Ce 
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to give a finite probability of finding the particle beyond the potential step, a probability 
which decreases exponentially with distance. This is a profound departure from classical 
behaviour. 

At the boundary x = 0, ip{x) must be finite to give a finite probability of finding the 
particle there, but there is a finite discontinuity in V(x). In these circumstances 
Schrodinger’s equation asserts that the second derivative 

d 2 ip(x) 

dx 2 

at x = 0 is finite, which means that both ip{x) and (dxp(x) /dx) are continuous at x = 0. 
These are the boundary conditions which allow the separate solutions 

i>i(x) and i> 2 {x) 

for the wave function, to be matched across the boundary of the two regions. 

The continuity of ip{x) at x = 0 gives ip\{x) = ip 2 {x) or A + B = C whilst 

dipijx) _ dip 2 {x) 
dx dx 



at x = 0 gives 



i k 1 (A — B) — — ocC — — cx(A T B) 



Thus 



and 



B = 



/i^±a\ 

\ik 1 — aj 



2\k\ 
ik\ — a 



A 



The wave functions for the separate regions then become 



ipi(x) =A 




I ikj + a c ~ik, x 
ik\ — a 



and 



Wi)=^Ae-“ 

iA' 1 — a 

and these are shown in Figure 13.5. Note particularly that the intensity of the incident part 
of the wave function 



\ipi{x)\ 2 = |A| 2 
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whilst the reflected intensity is 



LB 



2 



\k\ + a 

A 

lk i — a 



|A| 2 



Thus, for any energy E < V we have total reflection as in the classical case, even for those 
particles which penetrate the classically forbidden region x > 0 where ip 2 (x) is finite. 

In region 2 the probability of finding the particle is 



P(x) = \^i(x)\ 2 = |Ce “'I 2 



2\k\ 
ik\ — a 



Ae‘ 



4k 2 



kl 



■ A 2 e~ 2ax 



cr 



Since the exponential coefficient a depends on V(x) the greater the value V(x) the faster 
the wave function ip 2 (x) goes to zero in region 2 for a given total energy E < V. 

When V(jc) — > oo, as in the case of the infinite potential well, ip 2 (x) becomes zero, as we 
have seen; and there is no penetration into the classically forbidden region. 

Several important physical phenomena may be explained on the assumption that a 
particle with E < V meeting a potential step of finite height Fand finite width b has a wave 
function ip 2 (x) which is still finite at x = b, making it possible for the particle to tunnel 
through the potential barrier (Figure 13.6). The probability that the particle will penetrate 
the barrier to x = b is given by 



P(x) = \ipi{x)\ 2 ex e 2ax 

and beyond this barrier the particle will propagate in region 3 with a wave function fifix) 
of reduced amplitude. The boundary conditions must then be applied at x = b to match 

ip 2 {x) to VaM- 




3 

V3(x) 



Figure 13.6 Narrow potential barrier of width b penetrated by a particle represented by ip fix) 
leaving a finite amplitude ip 3 (x) as a measure of the reduced probability of finding the particle in 
region 3 
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Figure 13.7 Application of an electric field E to the surface of a metal at potential l/ 0 reduces the 
potential to V = V 0 — Ex forming a barrier of finite width which may be penetrated by an energetic 
electron near the metal surface 



This quantum ‘tunnel effect’ is the basis of the explanation of the radioactive decay of 
the nucleus. In addition the potential step seen by a free electron near the surface of a metal 
may be distorted, as shown in Figure 13.7, by the application of an external electric field, to 
form a barrier of finite width. The most energetic electrons near the surface of the metal can 
leak through the barrier in a process known as field electron emission. 

Another example results from the two possible positions of the single nitrogen atom with 
respect to the three hydrogen atoms in the ammonia molecule NH 3 . These positions are 
shown as N and N' in Figure 13.8 together with the potential barrier presented to the 
nitrogen atom as it moves to and fro between N and N' . This penetration occurs at a 
frequency of 2.3786 x 10 10 Hz for the ground state of NH 3 and its high definition is used 
as an atomic clock to fix standards of time. 



H 





Figure 13.8 The two possible configurations N and N' of the nitrogen atom with respect to the 
triangular hydrogen base in the ammonia molecule NH 3 and the finite potential barrier penetrated by 
the nitrogen atom at a frequency of >10 10 Hz in the NH 3 ground state 
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(Problem 13.16) 

(b) E > V 

In the region x < 0 in Figure 13.5 V(x) 

d 2 M*) 

dx 2 



0 and Schrodinger's equation is 
2 mE 



rr 



-ipi(x) = 0 



or 



with 



C>Vl 

dx 2 



+ k 2 ip i = 0 



, 0 2mE 

l '=lv 



having a solution 



ipi(x) = Ae iklX + Be~ ikiX 



with both incident and reflected terms. 

The momentum of the particle is p \ where p] /2m = E. 

In the region x > 0, V(x) — V and Schrodinger’s equation is 

d 2 ip 2 (x) 2 m(E-V) , , . n 

-4V-+ l M*) = o 



or 



where 



d 2 ip 2 

dx 2 



+ k\l\)2 = 0 



,,2 _ ME - V ) 

2 h 2 

and the particle momentum p 2 is given by p 2 /2m — (E — V). 

In the wave function solution for this region we consider only the right-going or 
transmitted term since there is nothing beyond x — 0 to cause a reflection, so 

ip 2 {x) = Ce ,klX 



Now the wave number k is related to the de Broglie wavelength of the particle and we 
see that k changes when the potential V changes; that is, when the particle experiences a 
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change in the force acting on it. Such a particle therefore reacts to a changing potential as 
light reacts to changing refractive index. As the potential V increases for E > V the 
momentum p and wave number k(p = Tik) decrease and the wavelength A increases. 

At x = 0 the conditions for continuity give 

= tylix) 



or 



and 



or 



A+B= C 



dipijx) _ dip 2 {x) 
dx dx 



k\{A — B) = k 2 C 



These two equations give 



B = 



( k i ~ k i) A 
(k i + k 2 ) 



and 



k i + k 2 

Since B is not zero, some reflection takes place at x = 0 even though the energy E > V. 
This is clearly not classical behaviour. If many particles form an incident beam at x — 0 
and each particle has velocity 



p i lik i 

1 m m 



then the velocity of transmitted particles will be 



p 2 hk 2 

m m 



The incident flux of particles; that is, the number crossing unit area per unit time, may be 
seen as the product of the velocity and the intensity; that is 
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The reflected flux is 
and the transmitted flux is 



Vi\B 



Thus, the reflection coefficient, the ratio of reflected to incident flux is 

R _ vi\B\ 2 _ (ki - k 2 f 
v\ | A | 2 ( k { +k 2 f 



and the transmission coefficient, the ratio of transmitted to incident flux is 

T _ V 2 1 c\ 2 _ k 2 {2k i) 2 _ 4kik2 
ui|A| 2 ki (j il +k 2 ) 2 {ki+k 2 ) 2 

results which are similar to those for our classical waves in earlier chapters. 

Note that R + T = 1 showing that the number of particles is conserved. 

We have chosen here to apply R and T to a number of particles forming a beam. These 
coefficients, when applied to identical particles forming the beam, measure the average 
probability that an individual particle will be reflected or transmitted. 



(Problem 13.17) 



The Square Potential Well 

Let us consider a particle with energy E < V moving in the square potential well of width a 
in Figure 13.9. Within the well the potential is zero, and the value V of the height of the well 



V(x) = 1/ 



E< 1/ 



V(x) = 0 









V'sM 


VhM 


V'j.M 


3 

► 


i 


2 



x= 0 x= a 



Figure 13.9 A particle with energy E < 1/(1/ = the finite height of a square potential well of width 
a) may take only the energy values E satisfying the equation 

fZmE 2JE(V-E) 

t ano \ h ? =■ 2E _ V 

The wave functions in the three regions are matched at the boundaries x = 0 and x = a by the 
conditions that ip(x) and dip{x)/dx are continuous 
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is finite. This potential approximates that of a finite range force which has no influence 
beyond a limited distance. Outside the range of the force the potential may be considered 
constant. From our discussion of the infinitely deep potential well (V = oo) and of the 
potential step we can expect our wave function representation to have the form of an 
integral number of de Broglie half wavelengths within the well, plus an exponentially 
decaying penetration into the wall on either side. 

Writing Schrodinger’s equation for each of the three regions, we have for region 
1(0 < x < a) 



d 2 M*) 

dx 2 



2mE 

~¥~ 



ipi(x) = o 



with a solution, for k\ = 2mE/ Ti 2 of 



(x)=Ae iklX + Be~ ikiX 

— A(cos&i.r + i sin/cjx) + B(cos&ix — i sin&ix) 
= A i cos k\x + B\ sin^jx 



where A i = A + B and B i = i(A — B ). 
In region 2(x > a) 



d 2 i/j 2 (x) 
dx 2 



2m(E - V ) 

¥ 



tp 2 (x) = 0 



has the solution 



M*)=A 2 e ax + B 2 e~ ax 



where 



a 



2 



2m 

F 



(V-E) 



In region 3, (x < 0) 



d 2 i/j 2 (x) 

dx 2 



2 m(E - V ) 

¥ 



tp 3 (x) = 0 



has the solution 



ip 3 (x) =A 3 e ax + B 3 e~ ax 

For ip(x) to remain finite as x — > ±oo (normalization condition) A 2 and B \ must be zero, 
and the boundary conditions ip(x) and dtp(x) / dx continuous, must be satisfied at x = 0 and 



x = a. 
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At x = 0, 



ipi(x) = ip 2 (x) and 



(tyl (*) _ <9i/>3(*) 
<9x <9x 



give 


A i = A3 


(13.1) 


and 


k 1 B 1 = a A 3 


(13.2) 


whilst at x = a 


, , ^ ^ , &>pi(x) dip 2 (x) 

and gx = gx 




give 


A 1 cos£i <3 + B\ sin/cio = B 2 e~ Qa 


(13.3) 


and 


—k\A\ sin^iu + k\B\ cosk\a = —aB 2 e~ aa 


(13.4) 



In order to satisfy equations (13.1), (13.2), (13.3) and (13.4) some conditions must be 
imposed on k and a; that is, on the value of E, so only certain values of E are allowed. 
Equations (13.1) and (13.2) give 



A i _ k\ 

B\ a 

and this equation with equations (13.3) and (13.4) yields 

2k | a 

tan £ 0 0 

k\ — a- 



or 

f2mE 2JE{V - E) 
tan a\ — w- = — 

V h 2 2 E-V 

Only those values of E which satisfy this relation are allowed energy states, but these 
values must be found by numerical or graphical methods. 

The wave functions for the first three allowed energy values are shown in Figure 13.10 
and their general behaviour may be clarified by considering Schrodinger’s equation in the 
form 



il) 

j ip — — (+ve constant ) (E — V) 
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x = 0 x = a 



Figure 13.10 Wave functions for a particle in a square potential well with the lowest three allowed 
energies £ lr f 2 , £3. Note the exponential decay of ip(x) outside the box 



Now d 2 ip/dx 2 is the rate of change of the slope; that is, the curvature of the wave function 
and when E > V both sides of the equation are negative and the ip curve must everywhere 
keep its concave side towards the x axis as it always does, for example, in sine and cosine 
curves. The curvature increases with E so we shall expect more de Broglie half 
wavelengths in the higher energy levels. This is consistent with the argument that an 
increase in E increases the wave number k and reduces the de Broglie wavelength A. 

In the lowest energy level the ip curve is always without a node, the next level always has 
one node, the third two nodes, etc. but the zeros will not be quite equally spaced and the ip 
amplitude will not be uniform across the well. In particular it will increase near the 
potential walls as the particle is slowed down to give a higher probability of the particle 
being found there. Where E < V the ratio 



d 2 ip/dx 2 

tp 
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will be positive and the ip curve must keep its convex side towards the axis as in 
exponential curves. The classical boundary E = V must always mark the division where 
the character of the ip curve changes from one form to the other and the two parts of the 
curve will only match for certain values of E. 



The Harmonic Oscillator 

As a final example to illustrate the fitting of ip curves into a potential well we shall consider 
the potential curve V — jsx 2 of the harmonic oscillator in Figure 13.1 1. The calculation of 
the ip curves is too complicated for this chapter but their essential features confirm what we 
may expect from our earlier examples. Moreover, by purely classical arguments we shall 
obtain a very good approximation to the wave mechanical results. 

In 1901 Planck had postulated that the energy of such an oscillator could have the values 
E = nhv where n was an integer and v was the frequency. Schrodinger was able to derive 
this result in 1926 but one essential difference arises from the Uncertainty Principle which 
requires a minimum energy level or zero point energy of \ hv. 

For a classical oscillator the minimum energy E = 0, point 0 in Figure 13.1 1 gives the 
precise and simultaneous values x = 0 and p = 0; that is, a zero oscillation. The 
Uncertainty Principle forbids this. If is the smallest amplitude of the oscillator 
compatible with the Uncertainty Principle, then 

a o ~ 'Ax- 




Figure 13.11 Potential energy curve 1/ of a harmonic osciLlator with alLowed energy levels 
E„ = (n + j)hn . The energy E (with oscillator amplitude a) is shown in the text to define an average 
value of the de Broglie wavelength A = /j/(|mf ) 1 ' 2 
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If po is the maximum momentum of the oscillator with amplitude ao it may be either in the 
positive or negative direction so 



Po ~ 5 A p 

The energy of a classical oscillator is given by 

E = \mjj 2 a^ = \ w(ao)(mu)ao ) = ^uiaopo 
~ bujAxAp k, k, = ^hv 

All other energy levels will therefore take integral steps of Iw above this zero point energy. 
Let us consider the energy level of the oscillator which has an amplitude a so that 



1 



1 



1 



2 2 



E = \- V — f -sx = -sa — -mu> a 



2 m 



2 m 2 



so that 



oj V m 



The value of the kinetic energy of the oscillator averaged over the distance 2 a between ±a 
may be written 



j“ a p 2 /2mdx 1 
[“ dx = 2a 

J —a 



l 



E — -moj 2 x 2 ) dx = E — —mu>‘ L a z = —E 



1 



2„2 



because 



E = \mu) 2 a 2 



Thus, the average value of the kinetic energy 

P 



2 2 . 



9 =T £ 

2 m 3 



giving 



h 4mE 



P = X = 



This gives an average value for the de Broglie wavelength of 



A = 





440 



Wave Mechanics 



and we expect n half wavelengths to ht into the length 2 a at energy E where 



2 





Thus 



A n h 

n 2 ~ 2^4mE/3 



2 

UJ 




Writing uj = 2nv we have 




- nhv = 0.96 nhv 
2 



which is a fairly close approximation to nhv. The correct result, however, must take into 
account the zero point energy of \hv and the energy levels are given by 

E = (;? + \)hv, n = 0, 1, 2, 3, etc. 

The if; curves for the first four energy levels are plotted in Figure 13.12 together with those 
for \ip\ 2 . 

We see that whilst a classical oscillator may never exceed its maximum amplitude a 
particle obeying a wave mechanical description has a finite probability of being found 
beyond this limit. 
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E 2 = 

£1 = 
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Figure 13.12 Wave functions ip(x) and probability densities \ip(x)\ z for the first four energy levels 
of the harmonic oscilLator 
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(Problems 13.18, 13.19) 

Electron Waves in a Solid 

Bloch Functions and the Kronig-Penney Model 

When electrons move through a solid, e.g. a metal, they meet a series of potential barriers 
generated by the atoms or ions located at the centre of the valleys between successive 
barriers. Figure 13.13 shows such a one-dimensional lattice array of ions. The electron 
wave function is derived via Bloch functions and the electron behaviour is demonstrated 
using the Kronig-Penney Model which replaces Figure 13.13 in the first instance with a 
periodic series of potential wells of finite depth as shown in Figure 13.14. An exact but 
unwieldy solution can be found for the situation described by Figure 13.14, but Kronig and 
Penney, by deepening the wells and reducing their separation, were able to show how the 
electrons behaved and to demonstrate the restrictions imposed on their motion. 




Figure 13.13 A one-dimensonal periodic array of poterntial barriers formed by ions or atoms 
located along a crystal lattice 




Figure 13.14 A series of finite potential wells used by Kronig and Penney as a first approximation 
of Figure 13.13 
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In Figure 13.14 the space between the potential wells is a, the well thickness is b and its 
height is Vo. The problem is similar to that described on p. 435 where the total energy of 
the electron is E — Vq so the wave equation is 



d 2 i/; 
dx 2 



—t(E — Vo)ip = 0 
n 



Now, V(x) is periodic so V x = V (x + /) where l = a + b. Evidently, since the probability 
of finding an electron at x or at x + / is the same, we have 



|^(x )| 2 = \ip(x + Z )| 2 

Hence, we may write ip(x + l) — 7 ip(x) where 77 * = | 7| 2 — 1 (7* is the complex conjugate 
of 7 ). 

At this stage we could write 7 = e lkx , but this does not define k well enough to satisfy the 
boundary conditions at each end of the crystal. For periodic functions the conventional 
method to meet the boundary conditions is to form a ring of circumference of length 
L — Nl where L is the length of the crystal and N is the number of atoms along its length. 
Note that in Figure 13.13 the potential barriers at each end of the crystal add / to its length. 
Proceeding along the crystal (or around the ring) we have 



ip(x + 21) = il>{x + 1+1) — yip(x + /) = 7 2 (V’) 



or for r integral steps 

ip(x + rl) = Yi^(x) r — (0, 1, 2, 3 . . .N — 1) 

Now r — 0 and r — N are identical positions (one complete circuit of the ring), so 

ip(x + NI) = 7 iV ^(x) = ip(x) 



that is 



7*=1 



We may now write 



7 = e /2 nr/N (r = 0, 1,2,3 . . .) 



so that 



x^(x + I) = 70 ( x ) = e ,2 ’ Kr / N ip(x) 

The Bloch function /j/, (x) is defined by 

ip(x) = /jL k (x)e lhc 
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where 

Ek{x) — Hk(x + 1 ) 

Here, k = Inr/IN and fik(x) has the periodicity of the potential. Since r changes by units as 
we move along the crystal each step of r/N (for N large) is so small that k — Inr/IN may 
be considered as varying continuously. 

The Bloch functions satisfy all conditions because 

ip(x + l) = e lk ^ x+r> Hk(x + 1) = e' kl e lkx Hk{x) — e l ^^{x) = yil)(x) 

The wave equations of Figure 13.14 are 

^ V + oripi = 0 0 < x < « (13.5) 

and 

. V - /? 2 ^2 = 0 -b<x< 0 (13.6) 

where 

7 2mE , ~ 2m. , 

ct = — — and P = — (Vo - E) 

h 2 n 2 

with 

V(x) = V(x + /) and / = a + b 

The Bloch function jik (x) = f.ipx + Z) where l — a + b, so for x = — b we have 
/r.t(a) = )ik(—b), which is evident from Figure 13.14. 

Earlier examples in this chapter have shown that the boundary conditions require ip(x) 
and its first derivative to be continuous across any potential change. 

Applying t(j(x) = fit (x) e lkx to equations (13.5) and (13.6), we have 

Mi (x) = Ae i{a ~ k)x + Be~ i(a+k)x 0 < x < a 
Hi (x) = + De-V**** -b<x< 0 

so that the boundary conditions are 

wi.,, (t^ o =(t) mo 

and 

with (£)_-(£) 
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which give four homogenous equations. 
Remember that 



p(x) = n(x + /) 

As with the rectangular well on p. 435 these boundary conditions determine the permitted 
values of E (via a and (3). Here, the boundary conditions require either 
A = B = C = D = 0 or the determinant of their coefficients to be zero. Equating the 
determinant of the coefficients to zero gives the unwieldy expression 

— or 

— sin aa sinh f3b + cos aa cosh /3b = cos kia + b) (13.7) 

lap 

Kronig and Penney simplified this equation by allowing Vo to tend to infinity as b 
approached zero in such a way that V)/; remained constant. This has two important 
implications. First, the potential wells become very deep so that Figure 13.14 approximates 
Figure 13.13. Second, their separation is narrowed so that l = a + b k, a and we may 
rewrite equation! 13.7) as 



( ma\ sin aa 

Vo b \ h cosaa = cos ka (13.8) 

\ h ) oia 

~ 1 

The values of a = (2mE/H f which satisfy this equation determine the permitted energy 
values and wave functions of the electrons. 

Note that when Vo — > 00 equation (13.8) requires sin aa = 0 to remain valid, leaving 

a = ± — (n— 1,2,3...) 

or 

Tr 2 h 2 n 2 
2ma 2 



which are the quantized energies of the tightly bound electron in the infinitely deep 
potential of p. 420. 

At the other extreme when Vq = 0 equation (13.8) gives 



a = k — 




1 

2 



which allows E to take any positive value. This gives a free particle solution to the wave 
equation (graphed as the dotted parabola in Figure 13.16). 

Between these two extreme values of Vo the permitted values of the energy E are 
displayed on the graph in Figure 13.15 where the left-hand side of equation (13.8) is plotted 
against aa where aa is written w and Vq Z>(?t) is written K. 
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Figure 13.15 Allowed electron energy values are denoted by heavy horizontal lines which define 
the Brillouin zones. These occur when the left-hand side of equation (13.8) has values between ± 1. 
The curve is symmetric about the axis w = 0. 



Now the limits of cos ka in equation (13.8) are ±1 and these determine the allowed 
values of w — aa indicated by the heavy horizontal line on the w or aa axis. These in turn 
denote the permitted ranges or bands of energy values which the electron may take. The 
bands increase with w = aa and between the bands are gaps where electron energies are 
forbidden. The limits of each energy band are defined by cos ka = ±1 that is 

k = ± — (n — 1, 2, 3, . . .) 

and the regions in k space defining the energy bands are known as Brillouin zones. The 
band for n = 1 is called the first Brillouin zone, n = 2 is the second Brillouin zone and so 
on. Figure 13.15 can be displayed as the energy E versus k graph in Figure 13.16 where the 
dotted parabola defines the free electron energy E — ^-k 2 and the heavy lines at the k 
boundaries denote the permitted electron energies in a given band. The cosine curves 
joining the zone boundaries are justified by Figure 5.15, which shows that no new 
information is gained by extending the k range beyond — 7r /a <k< n/a. This limited 
range of k values defines the reduced zone scheme. 
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3n _ 2 k jt_ 0 _tt_ 27i 37i 

a a a a a a 



Figure 13.16 Figure 13.15 displayed as allowed electron energies versus k. The dotted parabola 
defines the free electron energy E = f) 2 k z /2m and the allowed energy bands are the Brillouin zones 
Bj. l/j, l/ 2 , l/ 3 are the energy gaps between the zones. The cosine curves joining the zone boundaries 
are justified by Figure 5.15, i.e. all relevant information is contained in the region < k < jj 



The number of energy states (excluding spin) in each zone is determined by 



2nr 2n r 
W~aN 



(r = 0,l,2,3...JV-l) 



for each k value represents an allowed energy state. Each value of r gives a different value 
of k\ there are N such values. Hence, in this range 



7T !T 2ir 2nr 

< k < — i.e. — = where a « / 

a a a Na 



the number of energy levels is equal to the number of atoms. 

As a h = l — > 00 each band contracts to a single level which is iV-fold degenerate 
since the electron can be bound to any one of the atoms. For finite values of l this 
degeneracy is removed and each discrete atomic level spreads into a band of N levels. 
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Figure 13.17 Elastic Bragg reflection occurs when electron waves are scattered by atoms in planes 
separated by a distance a. Principal maxima are formed when 2osin = n A 



Only free electrons will escape interaction with the ions in the crystal lattice; almost free 
electrons will experience weak coupling to the lattice. Coupling which is strong enough to 
reflect electron waves may be seen in terms of Bragg reflection. Figure 13.17. Here, waves 
reflected by successive planes in a crystal which are separated by a distance a reinforce to 
give maxima on reflection when 2 a sin 6 = nX. 

When 6 — 7r/2 and the coupling is strong enough the electron waves will be reflected 
from successive ions. Figure 13.18, giving a path difference of 2a. Reflection maxima 
occur for 



2a = ±«A = ±« 



2-7T 

T’ 



i.e. k — 



nil 
± — 



a 



Thus, Bragg reflections define the Brillouin zone boundaries. 



2 ► 

V < 

1 > 

• < a >•< a >•< a >*< a >• 

Figure 13.18 When 6 = 7t/ 2 in Figure 13.17 Bragg scattering by electron— ion interactions gives 
principal maxima when electron waves are reflected from ions separated by multiples of a. The 
condition 2 a = n A defines the Brillouin zone boundaries for n = 1,2, 3, etc. 
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Figure 13.19 The wave function v|/ (even) has an anti-node at an ion (atom) site. The anti-node for 
v|/ (odd) is Located midway between sites. This governs the energy of interaction, which is different 
for the two v|/ values 



Wave functions of electrons can be represented by travelling waves in both directions, 
i.e. by e ±,fa and for k = ± mr/a standing waves will be formed by the sum or difference of, 

e-g. 



e ,7ra/a and e~ i7Cx/a 




The energies associated with these two wave functions will differ when they interact with 
the ions. ip eve n = cosnx/a has anti-nodes (maxima) at the site of each ion so the electron- 
ion interaction is attractive and the energy corresponding to ?/> even is lowered. 
V'odd = sin 7 tx/g has its anti-node midway between ion sites where the potential is 
repulsive. Figure 13.19. The calculation of these energy shifts requires knowledge of the 
effective potential, but it can be shown that for ip eve n the energy change at a given V„ in 
Figure 13.16, where V n is the energy gap between bands, is A E = - ' V n and for (Add the 
energy change is A E = \ V n (see Problem 13.21). Note that the band widths and gaps 
increase with n. 

The band structure may also be demonstrated by considering the effect of tunnelling. 
Two widely separated equivalent potential wells may each contain a single electron 
occupying identical energy levels. When the potential well separation becomes small 
enough for the tunnelling of Figure 13.6 to be possible this symmetry is destroyed because 
the wave function of an electron spreads right across both wells and their separating 
potential barrier, Figure 13.20. There is a finite probability of finding an electron at any 
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Figure 13.20 When an electron can tunnel between two potential wells (a, b) and — (a, b) it cannot 
exist in a single energy state. The higher of the two resulting energy states has a greater curvature 



point x of its wave function ^{x) so the two electrons cannot occupy the same energy level 
and the single state splits into two. The lowest lying energy levels split into a narrow band 
of very closely spaced states since the barrier to tunnelling is very large for electrons in 
these levels. Higher energy levels have a wider spread and it is even possible for bands to 
overlap. The band structure helps to explain the difference between electrical conductors 
and insulators. 

Once an energy level is occupied by an electron it cannot accept another electron. 
However, in a metal only the lower energy levels in a band or Brillouin zone are occupied 
and an applied electric field can accelerate electrons which move to occupy higher 
available energy states within the band. Insulators have completely filled energy bands so 
the electrons cannot move under the influence of an electric field - there are no empty 
neighbouring states. 
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However, a very strong electric field can cause an electron to jump from the top of a band 
across a gap to occupy an empty level immediately above the gap, so the insulator breaks 
down. A spark can jump across an air gap between two terminals; lightning is such a spark 
on a much larger scale. A semiconductor is basically an insulator with a very narrow 
forbidden gap where even a small energy change will switch the insulator into a conductor. 



Phonons 

Pages 135 and 162 showed that the elastic field in a crystal could sustain transverse and 
longitudinal modes of vibration along a chain of atoms acting as a series of coupled 
oscillators. In a normal mode of angular frequency oj, every atom performed simple 
harmonic oscillations of t o\. On p. 440 we saw that the energy of such oscillations at atomic 
and sub-atomic levels was quantized with values of in + 1 /2)hu>. 

The concept of photons as quanta of energy hut associated with an electromagnetic held 
allows the analogy of phonons as quanta of energy associated with the elastic held. In a 
normal mode of angular frequency tu, the energy of a phonon is /zcu, so phonons can be seen 
as exciting a mode to an energy state (n + \ )hiOj . When n — 0 the mode lo 1 is left with the 
zero-point energy 3 hio,. A more detailed calculation of Debye’s theory of specihc heats 
(page 253) takes account of this quantization. 

Normal modes are plane waves extending throughout the crystal and phonons are not 
localized particles. The uncertainty principle prevents an exact determination of a phonon 
position and it exists as a localized wave packet of combined modes with a small spread of 
frequency and wavelength and a group velocity dcu/k. The number of phonons, like that of 
photons, is not conserved. They are created and absorbed by collisions and, like photons, 
they obey Bose-Einstein statistics (appendix 1). However, unlike photons, they exist only 
within the crystal. They contribute to the crystal momentum but do not carry momentum. 
This is evident from Figure 5.15 where a lattice vibration has a wave number 
k — k±- — (m — 1,2,3,...) so hk has no precise meaning. Indeed, when the mode 
oscillations are purely harmonic the equilibrium position is zero so phonon momentum is 
zero. 

Phonon-phonon collisions are usually three-phonon processes in which both transverse 
and longitudinal waves are involved. They are characterized by energy conservation 

hiCi = hu>2 + hu>2 

and by phonon wave vector conservation 

qi = q 2 + q3 

A phonon of wave vector q, can separate into two phonons with wave vectors q 2 and q 3 . 
Alternatively, q 2 can absorb q 3 to form qj. Phonon-phonon collisions play a role in the 
thermal conductivity of a crystal; neutron interactions with the crystal lattice also involve 
the concept of phonons. 

When particles, as waves, interact with crystal structures they create diffraction patterns 
when the particle wavelength is of the order of atomic separation within the crystal. 
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typically ~ 2 x 10 10 m. The waves of X-rays striking a crystal create principal maxima 
on reflection to satisfy Braggs Law (p. 447) when the path difference 

2 a sin 8 = nX 

where a is the separation between the reflecting (diffracting) planes. If k is normal to the 
particle wave fronts before striking the crystal and k' is normal to the wave front leaving 
the crystal the condition |k| = k' defines the scattering as elastic, so Bragg scattering is 
elastic. Knowing the plane separation of a nickel crystal, determined by X-rays, Davisson 
and Germer were able to find the wavelength of electrons by Bragg elastic scattering (see 
Problem 13.20). 

Neutrons with A~2x 10 10 m have been used in non-elastic scattering experiments 
where |k| |k'| to probe the structure of crystals, that is, the atomic arrangements and 

separation. Where X-rays interact chiefly with electrons surrounding the nucleus of an 
atom, uncharged neutrons interact much more strongly with its nucleus; lattice vibrations 
are set up so phonons play a role in the scattering. 

Non-elastic scattering may be seen in terms of Figure 13.21 where waves in the wave 
front normal to k are scattered by atoms 1 and 2 in a row where the atomic separation is a. 
The phase lag of the wave incident on atom 2 is a sin 0 with respect to that striking atom 
1, but after scattering it leads the wave scattered by atom 1 by a phase =kra sin <fi. A 
diffraction maximum occurs when the phase difference 

27T 27T 

— a sin# — —asincf) = kasinO — k'asincj) = I2 tt(1 = 1,2,3, . . .) 

A A 

i.e. 

a(k - k') = Z2tt 
or 



k-k' 




a 




Figure 13.21 When electrons are scattered from atoms separation a, in the same pLane, the 
scattering may be inelastic, i.e. |k| ^ | k 7 | . Here, the electron of wave number k = 2n/\ strikes atom 
1 ahead in phase of that striking atom 2 by 2-7r/A a sin 6, but after scattering it lags that from atom 2 
by a phase difference sin 4>. Note that X need not = X' 
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Note that k — k' is a vector in diffraction space and A' need not equal A. This is true for 
every row of lattice points in the x direction. 

The expression Z27r/« represents a series of planes in k space with a separation 2i r/a. 
Crystal planes in a second dimension with separation b would form another series of planes 
m = 1, 2, 3, . . .) with separation ^ in k space having lines of intersection with the 
series /. A set of crystal planes in a third dimension with separation c would form a final set 
of planes n l7T (n — 1, 2, 3, . . .) with separation 27 t/c in k space. These three sets of planes 
would meet in points (/, m, n) in k space to form the reciprocal lattice. In three dimensions 
the diffracted vector k — k' would end on a reciprocal lattice point /, m. n. There is no 
requirement for the directions a , b and c in the crystal to be mutually perpendicular, but a 
symmetry exists between the crystal lattice and its reciprocal in that planes in the one are 
perpendicular to rows of points in the other and the plane spacing in one is 27r times the 
reciprocal of the point spacing in the other. 

When neutrons are diffracted from a crystal lattice in which a phonon of wave vector q 
and frequency u> is already excited, more than one diffraction maximum can appear. This 
first maximum will result from Bragg elastic scattering, i.e. |k| = k' | . 

A second maximum occurs in a vector direction 

g = k - k ' + q 
or 

k' = k + q - g 

This suggests that a neutron of wave vector k has absorbed a phonon of wave vector q to 
become a neutron of wave vector k. In the scattering, because the neutron is initially 
outside the crystal, the crystal plus the phonon receives a momentum 

f?(k - k') = h( g - q) 

Conventionally, the momentum Tig is associated with the whole lattice while fiq (associated 
with the absorbed phonon) is known as the crystal or quasi-momentum of the phonon 
because it acts as a momentum when absorbed by the neutron. 

In pure phonon-phonon collisions two processes may occur. The three phonons involved 
may begin and end in the same Brillouin Zone, this is called a normal process. In some 
cases, however, the third phonon may finish outside the Brillouin zone. This is known as 
the Umklapp process. This occurs when a phonon is Bragg reflected (at the edge of a 
Brillouin zone) at the same time as it absorbs another phonon. We know, however, that a 
phonon of wave vector q is identical with a phonon of wave vector q ± so the third 
phonon may be considered as remaining within the Brillouin zone. Umklapp processes play 
a role in the thermal conductivity of a crystal in the following way. 

When the crystal lattice vibrations are purely harmonic the separation between adjacent 
atoms during vibrations contributes an energy term oc (x,- — x,-±i) 2 , where x, is the 
displacement of an atom from its equilibrium position. In this case a phonon may travel 
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along hundreds of atoms without hindrance. However, with increasing energy, i.e. 
temperature, vibrations become anharmonic and cubic terms replace the squared term 
above because separate normal modes become coupled. Effectively, a cubic term describes 
the emission of a phonon by another phonon or the decay of a phonon into two phonons 
and the energies of individual phonons are changed. The phonons constitute a gas where 
the phonons have approximately constant speed (unlike in a real gas), but have a larger 
number density and energy density at the hot end of the crystal. Heat flow is primarily by 
phonon flow with phonons being created at the hot end and destroyed at the cold end. The 
thermal resistance in an insulator is produced by collisions which reverse the group 
velocity of the phonons, and the Umklapp process involving high-energy phonons at Bragg 
reflection on the edge of the Brillouin zone is significant here. 

(Problems 13.20, 13.21) 



Problem 13.1 

The energy of an electron mass m charge e circling a proton at radius r is 




2 m 4n£o r 



where p is its momentum. 

Use Heisenberg’s Uncertainty Principle in the form ApAr « Vi to show that the minimum energy 
(Hj atom ground state) is 



E o = 



4 

—me 

8 £ q /z “ 



at a Bohr radius 



£ph 2 
nine 2 



Problem 13.2 

The observation of a particle annihilates its mass m and its rest mass energy is converted to radiation. 
Use the relations A p Ax ~ h and E = pc for photons to show that the short wavelength limit on 
length measurement is the Compton wavelength 




me 



Show that this is 2.42 x 10 12 m for an electron. 



Problem 13.3 

When x and p vary simple harmonically it can be shown that the averaged values of the squares of 
the uncertainties satisfy the relation 



(Ac 2 )(Ap 2 ) 
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If the energy of a simple harmonic oscillation at frequency uj is written 



E = ^- + - m u 2 x- 



show that its minimum energy is \hv. 

Problem 13.4 

An electron of momentum p and wavelength A = h/p passes through a slit of width A*. Its 
diffraction as a wave may be regarded in terms of a change of its momentum Ap in a direction 
parallel to the plane of the slit (its total momentum remaining constant). Show that the approximate 
position of the first minimum of the diffraction pattern is in accordance with Heisenberg’s 
uncertainty principle. (Note that the variation of the intensity of the principal maximum in the 
pattern is a direct measure of the probability of the electron arriving at a point on the screen.) 

Problem 13.5 

A beam of electrons with a de Broglie wavelength of 10 _5 m passes through a slit 10 m wide. 
Show that the angular spread due to diffraction is 5°47'. 

Problem 13.6 

Show that the de Broglie wavelength of an electron accelerated across a potential difference V is 
given by 



A = h/(2m e eV ) 1/2 = 1.29 x lCT 9 V“ 1/2 m 
where V is measured in volts. 

Problem 13.7 

If atoms in a crystal are separated by 3 x 10 -10 m (3 A) show that an accelerating voltage of ~ 3 kV 
would be required to produce electrons diffracted by the crystal. 

Problem 13.8 

Electromagnetic radiation consists of photons of zero rest mass. Show that the average momentum 
per unit volume associated with an electromagnetic wave of electric field amplitude Eg is given by 

P = \eoEl/c 



(Verify the dimensions of this relation.) 

Problem 13.9 

Show that the average momentum carried by an electromagnetic wave develops a radiation pressure 

P = cp = \eqEI 

when the wave is normally incident on a perfect absorber and a pressure 



P = 2 cp = s oEq 
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when the wave is normally incident on a perfect reflector. (Radiation incident from all directions 
within a solid angle of 2-7T will introduce a factor of 1 /3 in the expressions above.) 

Problem 13.10 

If the radiation energy from the sun incident upon the perfectly absorbing surface of the earth is 
1.4 W m~ 2 and the radiation comes from all directions within a solid angle of 1-k show that the 
radiation pressure is about 10 -n of the atmospheric pressure. 

Problem 13.11 

In a carbon molecule the two atoms oscillate with a frequency of 6.43 x 10 Hz. Show that the 
zero point energy is 1.34 x 10 -3 eV (leV = 1.6 x 10~ 19 J). 

Problem 13.12 

A particle of mass m moves in an infinitely deep square well potential of width 2 a defined by 

V(x) = 0 — a < x < + a 

V(x) = oo \x\ > a 

If it is described by the wave function 

1 ( Tt 2 x 2 \ 

^ X) = 7h{ l -^p) forW - fl 

= 0 |jc| > « 

show by calculating J" a \ip(x)\~ dx that the probability of finding it in the box is 0.96. 

Show that in its normalized ground state, it is represented by ip(x) = (l/^/a) cos (ttx/2o) and 
expand this in powers of ttx/ 2 a to compare it with the wave function above. 

Problem 13.13 

Show that the normalization constant for the wave function 

. , , . . n \7TX . n 2Try . tl^TTZ 

• u — ^ — /I ...« Sln _ — 

c 

of a deep potential well is equal to 

(8 /abc) 1/1 . 

Problem 13.14 

A total of N electrons occupy a volume V in a solid at a very low temperature between the energy 
levels 0 to E F the Fermi energy. 

Show that their total energy 



uz i — /X Olll Oil! 

a b 



describing an electron in a volume abc at the bottom 



U = 



E dn = 




‘E f 

J 0 




d E 



giving an average energy per electron of ^Ep. 
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Problem 13.15 

Copper has one conduction electron per atom, a density of 9 and an atomic weight of 64. Show that 
/?o, the number of free electrons per unit volume is « 8 x 10 2s m~ 3 and that the value of its Fermi 
energy level is about 7 eV (leV = 1.6 x 10“ 19 J). 

Problem 13.16 

The probability of a particle of mass m penetrating a distance x into a classically forbidden region is 
proportional to e~ 2ax where 

a 2 = 2 m(V — E)/h 2 

If x is 2 x 10 “ 10 m (2 A) and (V — E) is 1 eV (1.6 x 10 _19 J) show that 

e~ 2ax = 0.1 for an electron 
= 10 ~ 43 for a proton 



Problem 13.17 

A particle of total energy E travels in a positive x direction in a region where the potential energy 
V = 0. The potential suddenly drops to a very large negative value. Show that, quantum 
mechanically, the amplitude of the reflected wave tends to unity and that of the transmitted wave to 
zero. Note that this implies non-classical total reflection. 



Problem 13.18 

Show that Schrodinger’s equation for a one dimensional simple harmonic oscillator of frequency cu is 
given by 



d 2 ip 2 m 
Jx 2+r h 2 




1 

2 



2 2 
muj x 



ip = 0 



and verify that if a 2 = mui/H then 



iPo (x) = (a/V^) l/2 z- aV/2 



and 



ipi(x) = (a/2\Zn) l ^ 2 2axe “ 2x ~/ 2 

are respectively the normalized wave functions for Eq = (zero point energy) and E\ = 2 Tuo. 

Problem 13.19 

The normalized wave function for a one-dimensional harmonic oscillator with energy 
E n = (n + i)£w is 



ip„ = N„H„(ax)e a ~ x 2/2 , 



N„ = (a/n l/2 2 n n\) l/z 
a 2 = mui/h 



where 
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and 

Verify that ipo(x) and ipi(x) of Problem 13.18 satisfy the expression for i p„ and calculate ^ 2 ( 1 ) and 
ipz(x). 

Problem 13.20 

Davisson and Germer (1927) fired electrons with an energy of 54 eV at a nickel crystal which had an 
atomic plane separation of 0.91 x 10“ 10 m (0.91 A). Bragg reflection gave a diffraction maximum at 
65°. Calculate the reflected electron momentum p and the kinetic energy to show that the difference 
between the incident and scattered kinetic energies was within 3.9%. 



Problem 13.21 

The perturbed energies of v| / (odd) and v| / (even) due to electron-ion interactions are given by 



±E jyvvtMv 

J ip*ipdx 



where ip* is the complex conjugate of ip 



If the zero of energy is taken as the mean value of the potential then the potential may be written as 
a Fourier series in the form 



00 

V = — V„ cos l^nx/a 

n— 1 



where the V„ are the potential gaps in Figure 13.16. They are positive numbers for a potential with 
strong negative peaks at the lattice sites. For travelling waves ip = e ±,kx so ip*ip = 1, which gives 
AE = 0 in the above expression except for ip = sinfct or cos kx when k = n-n/a where a is the 
periodicity of the lattice. 

Show that for ip = sin ka 



ae = -J2 



J sin 2 kxV n cos dr 
J sin 2 kxdx 



= — V„ for k = im/a 



Show that ip = cos kx in the above expression gives AE = —\V n for k = nn/a 



Summary of Important Results 

De Broglie Wavelength A = h/p 



Heisenberg’s Uncertainty Principle (Bandwidth Theorem) 

Ax A p h 

AE At « h 



determines zero point energy. 
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Schrodinger’s time independent wave equation 

d 2 ^(x) 2 m(E — V) 

ip(x) =Ae ikx + Be~ ikx , 



where 



where 



c- = lm(E : v) e> v 

h- 

ip(x) = C e ax + De~ ax , 



a 



2m(V - E ) 

¥ 



V > E 



Probability per unit length of finding a particle at x 

P{x) = | ?/>(*) 1 2 

Normalization 



\tp(xyz)\ dxdvd-;=l 



all space 



Harmonic oscillator 
Energy levels E„ = (. n + ^)hv 
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Non-linear Oscillations and Chaos 



The oscillations discussed in this book so far have all been restricted in amplitude to those 
which satisfy the equation of motion where the restoring force is a linear function of the 
displacement. This restriction was emphasized in Chapter 1 and from time to time its 
limiting influence has required further discussion; for example, in Chapter 6 on acoustic 
waves in a fluid. We now discuss some of the consequences when this restriction is lifted. 

We begin with simple examples in mechanical, solid state and electrical oscillators. 
More complicated behaviour associated with chaos in these oscillators is also examined 
together with the appearance of chaos in biological and fluid mechanical systems. 



Free Vibrations of an Anharmonic Oscillator - Large Amplitude 
Motion of a Simple Pendulum 



In Figure 1.1 the equation of motion of the simple pendulum was written in terms of its 
angular displacement as 



d 2 e 

dr 2 



+ lu 2 0 9 = 0 



where u > q = g/I. Here, an approximation was made by writing 9 for sin 9: the equation is 
valid for oscillation amplitudes within this limit. When 9>1° however, this validity is lost 
and we must consider the more complicated equation 



d 2 6» 

At 2 



+ ujq sin 9 — 0 



Multiplying this equation by 2A9/dt and integrating with respect to t gives (d9/dt ) 2 = 
2cuqCos#+A, where A is the constant of integration. The velocity d9/dt is zero at the 
maximum angular displacement 9 = 9 o, giving A = —2ic ( 2 cos 9a so that 



d 9 
d t 



tuo[2(cos 9 — cos 0 q)] 1//2 
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or, upon integrating. 



(jjQt — 



dO 



{2[cos 6 — cos #o]} 



If 6 — 0 at time t = 0 and T is the new period of oscillation, then 6 = 8q at t — T/4, and 
using half-angles we obtain 



W0 4 = 



dd 



o 2[sin“ 6q/2 — sin~ 0/2] 



2 Q/r) 1 1/2 



If we now express 0 as a fraction of 8 o by writing sin 8/2 = sin(#o/2) sin^>, where, of 
course, — 1 < sin (f> < 1 , we have 



^(cosd/2)68 = (sin0o/2)cos<^><5^> 



giving 



7T T 
2To 

where To = Itt/loq. 

Expansion and integration gives 



'/ 2 dcj) 

o [1 — (sin 2 #o/2) sin 2 <fi\ 



T = T 0 (l + \ sin 2 8q/2 + ^ sin 4 8 0 /2 + 



or approximately 

T= 7o(l +isin 2 0 o /2) 



(Problem 14.1) 

Forced Oscillations - Non-linear Restoring Force 

When an oscillating force is driving an undamped oscillator the equation of motion for 
such a system is given by 



mix + s(x) = F () cos c ot 

where s(x) is a non-linear function of x, which may be expressed in polynomial form: 

s(x) — S\X+ S 2 X 2 + S 3X 3 . . . 

where the coefficients are constant. In many practical examples s(x) = s i x + s 3X 3 , where 
the cubic term ensures that the restoring force s(x) has the same value for positive and 
negative displacements, so that the vibrations are symmetric about x = 0. When ,v 1 and s 3 
are both positive the restoring force for a given displacement is greater than in the linear 
case and, if supplied by a spring, this case defines the spring as ‘hard’. If s 3 is negative the 
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Figure 14.1 Oscillator displacement versus restoring force for (a) linear restoring force, (b) non- 
linear 'hard' spring, and (c) non-linear 'soft' spring 



restoring force is less than in the linear case and the spring is ‘soft’. In Figure 14.1 the 
variation of restoring force is shown with displacement for 53 zero (linear), s 3 positive 
(hard) and s 3 negative (soft). We see therefore that the large amplitude vibrations of the 
pendulum of the previous section are soft-spring controlled because 

sin (9 « 0 — ^(9 3 

Figure 14.2 shows a mass m attached to points D and D\ a vertical distance 2 a apart, by 
two light elastic strings of constant stiffness .v and subjected to a horizontal driving force 
F 0 cos cot. At zero displacement the tension in the strings is To and at a displacement x (not 
limited in value) the tension is T = To + s(L — a) where L is the stretched string length. 

The equation of motion (neglecting gravity) is 

mx = —2 T sin 9 + F 0 cos uit 

— — 2[Tq + s(L — «)]- + Fq cos cot 



D 




D' 



Figure 14.2 A mass m supported by elastic strings between two points D and D' vertically 
separated by a distance 2 a and subjected to a Lateral force F 0 cos u>t 
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Inserting the value 



L — a 



1/2 



and expanding this expression in powers of x/a, we obtain by neglecting terms smaller than 

(. x/a ) 3 



2 T 0 (sa-T 0 ) 3 



mx = x — ■ 

a 



x + F o cos Lot 



which we may write 



where 



3H- s\x + s 3 x 



3 



^0 

= — cos Utt 



m 



2 r 0 sa-T 0 

= and ^ 3 = r — 

ma ~ ma i 



If s 3 is small we assume (as a first approximation) the solution x\ 
from the equation of motion 



A cos cut, which yields 



O "3 -*0 

X] = — SiA COSCUf — S3A COS utt -\ cos lot 



Since cos 3 utt — | cos utt + \ cos 3 uit, this becomes 

xi = — («iA + |i 3 A 3 — Fo/m ) cosuit — -[s^A 3 cos 3 utt 

Integrating twice, where the constants become zero from initial boundary conditions, gives 
as a second approximation to the equation 



the solution 



X + S\X + s 3 X 



3 



F 0 

= — cos utt 



m 



X2 



1 / , 3 o F 0 \ j 3 A 3 

ijA + -.V 3 A coscut + — — ^cos3t at 

\ 4 m J 36cu- 



Thus, for 53 small we have a value of c o appropriate to a given amplitude A, and we can plot 
a graph of amplitude versus driving frequency. Note that we have a third harmonic. We see 
that for a system with a non-linear restoring force resonance does not exist in the same way 
as in the linear case. In the example above, even when no damping is present, the amplitude 
will not increase without limit for a driving force of a given frequency, for if u is the natural 
frequency at low amplitude it is no longer the natural frequency at high amplitude. For s 3 
positive (hard spring) the natural frequency increases with increasing amplitude and the 
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Figure 14.3 Response curves of amplitude versus frequency for oscillators having (a) a 'hard' spring 
restoring force, and (b) a 'soft' spring restoring force. In the extreme case (c) the tilt of the maximum 
is sufficient to allow multi-valued amplitudes at a given frequency and 'shock jumps' may occur (See 
Figure 15.1 for comparable behaviour in a high amplitude sound wave.) 



amplitude versus frequency curve has a tilted maximum (Figure 14.3a). For a soft spring, 
.v 3 is negative and the behaviour follows Figure 14.3b. It is possible for the tilt to become so 
pronounced (Figure 14.3c) that the amplitude is not single valued for a given u> and shock 
jumps in amplitude may occur at a given frequency (see the next chapter on the 
development of a shock front in a high amplitude acoustic wave). 



(Problems 14.2, 14.3) 

Thermal Expansion of a Crystal 

Chapter 1 showed that the curve of potential energy versus displacement for a linear 
oscillator was parabolic. Small departures from this curve are consistent with anharmonic 
oscillations. Consider the potential energy curve for a pair of neighbouring ions of opposite 
charge ±r in a crystal lattice such as that of KC1. If r is the separation of the ions the 
mutual potential energy is given by 



V(r) 



ae 2 (3 

1 

r rP 



where a and (3 are positive constants and p = 9. This is plotted in Figure 14.4, which shows 
that the potential energy curve is no longer parabolic. The first term of V(r) is the energy 
due to Coulomb attraction; the second is that of a repulsive force. The value of a depends 
upon the presence of neighbouring ions and is about 0.3. The constant (3 can be found in 
terms of a and the equilibrium separation ro because, in equilibrium. 




p/3 



-P + 1 



= 0 



dV 

dr 



r=r o 
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Figure 14.4 Non-parabolic curve of mutual potential energy between oppositely charged ions in 
the lattice of an ionic crystal (NaCl or KCl). The combination of repulsive and attractive forces yields 
an equilibrium separation r 0 . Very small energy increments give harmonic motion about r 0 but 
osciLlations at higher energies are anharmonic, leading to thermal expansion of the crystal 



giving 



P 



ae 2 r 



p - 1 

o 



P 



X-ray diffraction from such crystals gives r () = 3. 12 A for KCl, so that 6 may be found 
numerically. 

To consider small displacements from the equilibrium value ro let us expand V ( r) about 
r — r o in a Taylor series to give 



V(r) 



V(r o) +•* 




r o 



2 ! 




3! 




r o 



where x = r — ro. Since (dV/d r) r — 0, we may write 

X 2 fir 3 

V(r)-V(r 0 ) = V(x)=A- + — 

The quantity Ax 2 / 2 is the quadratic term familiar in the linear oscillator, so that for very 
small disturbances the bottom of the potential energy curve is parabolic, and a small gain in 
energy causes the ion pair to oscillate symmetrically about r — ro. An increase in the ion 
pair energy involves the second term Bx 2 / 6, and oscillations are no longer symmetric 
about ro, because \i ~2 — /'o| > |r [ — r o | in Figure 14.4. Hence the time average for r — ro is 
not zero as it is for a linear oscillator, and this time average r, > ro. If all ion pairs acquire 
this amount of energy, for example by heating, the crystal expands. We may consider the 
force between the two ions as 



F 



dV 

dx 
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and note that the quadratic term here is responsible for the lack of symmetry in the motion. 
If it were a cubic term as in the previous example the symmetry of motion about r o would 
still occur. The coefficient A in the force equation is the force constant in the discussion on 
crystals in Chapters 5 and 6 and leads directly to Young’s modulus. The coefficient B gives 
information on the coefficient of thermal expansion of the crystal. 

(Problems 14.4, 14.5) 

Non-linear Effects in Electrical Devices 

A feature of the non-linearity in the mechanical devices discussed earlier was the 
introduction of harmonics of the fundamental frequency of the driving force. It is 
comparatively simple to avoid these effects of non-linearity in electronic systems by 
choosing a small linear portion of the operating characteristic and amplifying the response 
in stages. In an electromechanical device such as a piezoelectric crystal linearity is again 
achieved by restricting all oscillations to small amplitudes and amplifying the response. In 
electroacoustic devices such as microphones and loudspeakers the introduction of 
harmonics often leads to severe distortion. In the loudspeaker of Figure 14.5 even if a 
pure sinusoidal wave is delivered to the speech coil it is difficult to provide a mechanical 
suspension for the cone which has a linear response. The cone acts as a piston radiating 
acoustic power, and limitation of amplitude together with inevitable mismatching of 
acoustic impedances reduces the efficiency of transforming electrical into acoustic power 
to less than 10%. Fortunately the ear is a sensitive device. 

Non-linear electrical oscillators are, however, often used, and Figure 14.6a shows a 
‘relaxation oscillator’ circuit where a capacitance is discharged very rapidly through a 
gaseous conductor such as a hydrogen tube. E is the constant charging potential and i is the 
instantaneous value of the current which charges the capacitor through the resistor R to a 
potential V s , the striking potential, at which the gas in the tube is ionized. The tube 



Driving force F 

a current in coil x magnetic field in gap 




Figure 14.5 A pure sinusoidal wave input to an eLectroacoustical device such as a loudspeaker wilL 
Lead to distorted sound output if the cone suspension has a non-linear stiffness at high amplitudes 
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Figure 14.6 Electrical circuit of a non-linear 'relaxation oscillator'. A capacitance C is charged 
through a resistance R to a potential l/ s < E, at which the gas-filled valve strikes and rapidly 
discharges the condenser to an extinction potential l/ e , when the valve ceases to conduct and the 
cycle is repeated 



becomes highly conducting and discharges the capacitance in a negligibly short time to V e , 
the extinction potential, at which the tube ceases to conduct. The capacitance charges again 
to V s and the cycle is repeated. The variation of voltage across the capacitance with time is 
shown in Figure 14.6b. Assume that at point A and time t the capacitance has just 
discharged. If current / o is flowing at time t = 0 then 

V e = E - i 0 Re~ t/RC 

The capacitance charges to the potential V s in a time r so that 



giving 



giving 



V s = E ~ i 0 Re~ {,+T)/RC 

V s -V e = i 0 R (e ~‘l RC _ e -h+T)/*C) 
= i 0 Re~ t/RC [ 1 - e~ T/RC ] 

= (E — V e ) [1 — e,~ T / RC ] 

- t/RC _ E - V s 

E - Fe 



or 




E~ 

E-Vj 



t = RC 
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The period of oscillation is therefore directly proportional to the charging time constant RC. 

A more sophisticated circuit produces a linear charging system with a very short 
discharge time so that the exponential voltage output becomes linear and gives a ‘sawtooth’ 
waveform. From Chapter 10 we know that this periodic function contains many harmonics. 
A sawtooth voltage output applied to the time base of an oscilloscope produces a linear 
sweep of the spot across the tube. 



Electrical Relaxation Oscillators 

Van der PoL and Chaos (1926-1927) 

The work of Van der Pol continues to attract the attention of research workers in chaos 
chiefly because of an equation he derived at that time. His relaxation oscillator was a 
multivibrator, a two stage resistance-capacity coupled amplifier with the output of the 
second triode fed back as input to the grid of the first. His analysis used the mechanical 
form of the damped simple harmonic equation with a negative resistance term which 
increased the amplitude, thus 

x — ax + uj 2 x = 0 

with a solution 

x = Ce +Crt / 2 sin [(tu 2 - a 2 /4)t + <£>] 

for a > 0 and a 2 /4 < to 2 . 

He restricted the unlimited growth of x by replacing a with a — 3y/x 2 where 7 is a 
constant, writing cot = t' and x = (a/ 37 ) t0 give his equation the form 

v — e(l — v 2 )v + v = 0 

where e = a/u> and v = du/d t' . 

It is this equation with a forcing term A sincuo? on the right hand side which is known as 
Van der Pol’s equation and which has formed the basis of a number of studies in chaos, one 
of which we shall meet later. Van der Pol found that as e increased his oscillator gradually 
assumed the period r = RC with the output for e = 1 0 shown in Figure 14.7 (Van der Pol, 
1926). 

Even more interesting from the viewpoint of chaos was the oscillator by which he could 
produce subharmonics of its natural frequency. Such a phenomenon, period doubling, 
tripling, etc. is now recognized as an early sign of chaos, indeed Li and Yorke (1975) have 
published a paper entitled ‘Period 3 implies Chaos’. 

Van der Pol's period doubling circuit is shown in Figure 14.8. With E 0 = 0 and 
C = 1 0 3 pF the relaxation frequency of the system was 10 3 cycles. Setting £’ () sin cot at 
7.5 sin 2x107 he was able, by increasing C through the range 5-40 x 1 0 3 pF to produce 
subharmonics lu/2,lu/3 . . . tu/40. . . u/200. He registered the output on a pair of loosely 
coupled telephone earpieces and his paper makes the interesting comment that ‘often an 
irregular noise is heard in the telephone receivers before the frequency jumps, however this 
is a subsidiary phenomenon'. In fact, such internally generated noise accompanied by 
subharmonics is one of the early signs of chaos (Van der Pol and Van der Mark, 1927). 
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Chaos in Population Biology 

Chronological accounts of a modern research topic rarely present the most coherent 
picture. The significance of early developments is not always recognized until much later; 
indeed the first recorded strange or chaotic attractor, that of Lorenz in 1963, comes at the 
end of this account but only because of its level of sophistication. Even the simple example 
with which we begin was not fully explained when it first appeared. 

Despite its simplicity the example of population biology reveals many of the 
characteristics displayed by chaotic systems. These are: 

• The chaos is deterministic and not random; that is, the paths followed by trajectories are 
governed by solutions to given non-linear equations. 

• Trajectories from closely neighbouring starting points diverge with time. 

• Trajectories can, according to the conditions, finish on a stable point attractor, they can 
diverge to infinity from a repellor or at some stage they can orbit in what is known as a 
limit cycle. 

• Such a limit cycle can develop an infinite series of period doubling; odd number periods 
may be generated, also completely aperiodic trajectories which still remain within a 
bounded region of space. 

• With the appearance of chaotic motion the sharp definition of these frequencies is 
gradually overcome by a growing background of wide band noise which is internally 
generated. 

A number of equations dealing with population biology has been widely studied but we 
consider the simplest, a quadratic equation discussed by May (1976) in a classic review. 
This is known as the logistic map and is given by 

Xn+l = 4Ax„(l -x n ) 

where the subscripts refer to the year in which the population was measured and A is a 
parameter. Restricting the values of x and A to 0 < x < 1 and 0 < A < 1 is a scaling device 
which keeps the dynamics within the limits of a diagram. Because it involves only the 
coordinate x this logistic equation is known as a one-dimensional map. 

Much of the behaviour of populations under this quadratic rule is shown by the 
interaction of the parabola and the straight line bisector x n+ \ — x„ of gradient unity and 
this behaviour is divided into three distinct categories by the A ranges 0< A<±,±< A<§ 
and | < A < 1 . 

To illustrate the general use of the bisector consider what happens to a population with a 
constant reproduction rate; that is, the straight line x n+ \ = AXx n . Figure 14.9a shows the 
line for A > | compared with the bisector x n+ \ — x n . Taking xq as the starting value of the 
population gives x\ on the A >5 line which then projects horizontally to the same value 
(x\ ) on the bisector. This gives the value x\ on the base line which projects vertically to 
the A > | line to give xo and the process is repeated. Evidently for A >1 the population 
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Figure 14.9 Change of population with constant reproduction rate given by x„ +1 = 4Ax„. (a) for 
A > | the population — > oo as the trajectories move away from the origin (a repellor). (b) For A < | 
the population is extinguished, all trajectories moving to the stable point attractor at zero. The 
initial population atx 0 gives Xi on the A > 7 line which projects horizontally to the same value on 
the bisector x n+1 =x n . The value Xi projects vertically to x 2 on the A > \ line and the process 
repeats itself. Similarly for A < | 
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increases without limit, the trajectories move to infinity from a repellor. For x < Fig- 
ure 14.9b, the same process of horizontal and vertical projection produces x\ < xo and the 
population is extinguished, all trajectories moving to the stable point attractor at zero. 

The method is equally applicable to the parabola 

X n +1 = 4Ax„(l -x„) 

For A > \ we have Figure 14.10 and where the curve and the bisector intersect we have 
x n +i — x n corresponding to a fixed point in the iteration process. Writing this value as 
x n+ 1 = x n = x* we find from x* = 4Ax*(l — x*) the two roots x* = 0 and x* — 1 — ^ 
each of which is a fixed point. 

Restricting x and A to the values between 0 and 1 gives for A < | only the value x* = 0 
but for i < A < l,x* may take both values. If x* is stable; that is, a fixed point to which 
the end points of all trajectories become infinitely close, Figure 14.10, it is a point attractor 
and this stability depends on the slope of the curve at x*. We write x n +\ = 
4Ax„(l — x n ) — f(x n ) and if —1 </'(x) < 1 at x*, x* is stable. When the slope /'(x) 
equals —1 stability is lost and x* bifurcates into two new values, each of which is stable. 
This is called a pitchfork bifurcation and is the origin of the period doubling sequence in 
the logistic map. Odd numbered periodic cycles arise at a later stage from bifurcations into 
pairs of new values, only one of each pair being stable. These are called tangent 
bifurcations. 




Figure 14.10 The Logistic equation x n+1 = 4Ax„(l -x„) cut by the bisector x„ +1 =x n at the 
points x* = 0 and x* = 1 — When | < A < | the Latter vaLue of x* is a stabLe point attractor for 
aLL trajectories as shown 
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The dependence of stability upon /'(x) at the fixed point x* follows from Taylor's 
theorem for, with x „ , i — f(x n ) and x„ = x* + s„ where e n is a very small quantity, we have 

X n + 1 =f(x* + £„) ~f(x*) + £ n f'(x*) 

= x*+e n f(x*) 



because x* — f(x* ) at this fixed point x*. 

Now x n+ \ = x* + e n +b giving f(x*) — e„ + i/e„ and for n — > oo, £ n+ \ — > 0 only if 
-1 <f(x) < 1. 

Thus, x* = 0 is a stable point attractor for all trajectories when A < \ but becomes 
unstable at A = \ while x* = 1 — is a stable point attractor for all trajectories when 
\ < A < |. At A — | the slope of /(x) at x* = 1 — ^ equals —1, stability is lost, x* 
bifurcates and a stable oscillation between two new values x* and x* develops. We can see 
this by studying the behaviour of x n+ 2 versus x„, obtained by a double application of the 
logistic equation. 

We can express x „ + 2 = /(x„+ 1 ) — ff(x n ) = / 2 (x„) where the superscript defines the 
double application. A graph of / 2 (x), which is symmetric, is shown in Figure 14.11a where 
the central minimum decreases as A increases. The bisector is now of course x „ + 2 = x„ 
and, as shown, it cuts/ 2 (x) at three fixed points. The value of A is chosen so that x* is near 
the minimum and x* is near a maximum. The slope of/ 2 (x) (written / 2, (x)) at x* and x* 
is therefore close to zero and x* and x* are stable fixed points of/ 2 (x). It is at this value of 
A = | that period doubling begins. 

The third fixed point x* is clearly the original fixed point of /(x). This follows from 
noting that the point x* — x„ = x n+ \ =x n+ 2 falls on both /(x) and f 2 (x) and on their 
respective bisectors. In addition, the stability behaviour of x* is the same for /(x) and 
/ 2 (x). We can show this via the chain rule, for if 

X 2 =f(x t) —f 2 (xo) where xq =/(x 0 ) 



then 



f 2 \x) =f(x 0 



d[/(xi)]dxi 
dr i dx 



dfjx i) 

dxi 



fix) 



where all derivatives are evaluated at x = xo- This result holds for higher values of the 
superscript n in /"(x). 

Taking xq as the fixed point x* then 



x 



* 



X 0 = Xi 



X 2 



and 



f 2, (x*) =f(x*)f(x*) = (f\x*)) 2 . 



Thus, if x* is stable (unstable) in/(x) then it is stable (unstable) in/ 2 (x). 
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The stable fixed points x* and x* of f 2 (x) for A > | are not fixed points of f(x). Clearly, 
since these points lie on the bisector x „ + 2 = x n , each will return to itself every second 
iteration. This can occur only when the expressions 

x* =/(x*) and x* =/(•**) 

jointly hold so a trajectory ends in the cycle x* — >x* — — >x*, Figure 14.11(b). 

(Problem 14.6) 

In the same way that x * and x* became the two stable points at A i = | they will become 
simultaneously unstable for some larger value A 2 when f 2 '(x'*") — —1. At A 2 , xf and x* 
will each bifurcate to two stable points to give a stable 4-cycle period based on the stable 
fixed points of/ 4 (x). As the period doubling sequence continues, via pitchfork bifurcations, 
the values A 1 , A 2 , A 3 , A 4 . . . for the cycles 2, 2 2 , 2 3 , 2" . . . converge geometrically and 
Feigenbaum (1978) found that for this period doubling sequence the limit as n — > 00 is 
given by 

<^00 = , An+1 ~ A ” = 4.6692016 

This result appears to be verified not only for the logistic map but for other non-linear 
equations with a single maximum and many experiments, computer simulated and 
otherwise, support Feigenbaum's result. 

The value of A at which the cycle 2 ”(n — » 00 ) is approached is given by Aoo = 0.8925. 
This is illustrated in Figure 14.12 where the successive bifurcations of 2" cycles become 

1.0 
0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0.0 

0.75 1.0 

Figure 14.12 Bifurcations at period doubling for the Logistic map begin at A = | and reach the 
limit 2 °° at A^. Between A^ and A = 1 chaotic behaviour is interspersed with regions or windows 
at which odd numbered cycles of period k and their harmonics k2" appear. Some cycles are aperiodic 
(Figure 14 . 13 ). (From Tabor, 1989 ) 
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Figure 14.13 An aperiodic cycle which remains bounded within the system for | < A < 1 



increasingly compressed in the A space. Between the values of Aoo and A = I a very rich 
behaviour is observed; there is an infinite number of different periodicities and an 
uncountable number of very long cycles of no measurable period but which remain 
bounded within the system (Figure 14.13). 

The order in which these cycles appear has been successfully predicted by Metropolis et 
al. (1973). The first odd cycle appears at A = 0.9196 and the first period 3 cycle appears at 
A = 0.9571. This is an important cycle because of the paper by Li and Yorke entitled 
‘Period 3 implies Chaos’ (Li and Yorke, 1975). 

We can examine the origin of the first period 3 cycle in Figure 14.14(a). At some value 
A* the bisector x,j + 3 = x„ is tangent to the curve x n+ j, — f(x„) at the three fixed points xf, 
x*, x * . The slope of f 3 (x„) at these points must equal +1 and each of these three unstable 
fixed points bifurcates into a pair of which one is stable and the other is unstable. This is the 
tangent bifurcation. The period 3 cycle orbits between the three stable fixed points (one 
from each bifurcation) and we can follow the bifurcation process by increasing A beyond 
A* by a small quantity. This heightens the maxima and deepens the minima so that the 
bisector now cuts / 3 (x„) in pairs of points one on each side of the tangent position. A 
typical pair is shown in Figure 14.14(b) on a magnified scale. The tangent point T 
splits into points A and B each of which moves along the curve from T as A increases. 
Point A moves from a gradient position of +1 around the curve maximum to a 
gradient position of less than 1 and forms the stable fixed point of the bifurcated pair. Point 
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Figure 14.14 (a) The first period 3 cycle appears at A = 0.9571. Just below this value of A the 

bisector x n+3 = x„ is tangent to x„ +3 = f 3 (x„) at three unstable fixed points (gradient = +1). A 
small increase of A splits these points into pairs, one point of each pair becoming stable, (b) 
Magnification at tangent point T which splits into a pair A and B with a small increase in A. At T the 
gradient is +1 (unstable), A is stable at a reduced gradient and B is unstabLe at an increased gradient 
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Figure 14.15 The one-dimensional logistic equation x„ +1 = 4Ax„(l — x n ) is non-invertible 
because trajectories cannot be traced uniquely backwards to their origins. Each x„ +1 can arise from 
two different values of x n 



B moves from T (gradient +1) along the curve to a steeper gradient position and remains 
unstable. 

Thus, to quote May, ‘the fundamental stable dynamical units are of basic period k which 
arise by tangent bifurcation along with their associated cascade of harmonics of periods 
k2" which arise by pitchfork bifurcation. The hierarchy of stable cycles of period 2" 
(namely, k = 1) is merely a special case albeit a conspicuously important one’. 

The one dimensional logistic map has one profound limitation. Figure 14.15 shows that 
it is symmetric about the point x = \ so that any x n+ \ can arise from one of two different 
values x n and x' n . This fails an essential requirement in chaos theory, namely that all 
trajectories may be traced uniquely backwards in time to their origins. This property is 
known as ‘invertibility’ and clearly the logistic map is non-invertible. 



Chaos in a Non-linear Electrical Oscillator 

The development of the varactor has made it possible to display many features of the 
preceding section on a cathode ray oscilloscope in a first year university laboratory 
experiment. The varactor acts as a diode in the forward direction but behaves in the reverse 
direction as a variable non-linear capacitance in a series LCR circuit. Testa et al. (1982) 
confirmed not only many of the results above but, in addition, supported two predictions 
made by Feigenbaum (1979). These were 

1. That bifurcation at period doubling follows a distinct procedure — as a 2" cycle loses 
stability after 2" iterations, a point of the attractor just misses duplicating itself with 
duplication occurring only after another 2” iterations. Thus each element of the cycle 
splits into closely spaced pairs with 2" iterations required to visit an element from its 





Figure 14.16 In the period doubling process the separation of adjacent elements in a pair is 
reduced by a universal factor a from one bifurcation to the next. For period doubling between 16 and 
32 a = a/b = 2.35 and a = c/a = 2.61. Reproduced by permission of The American Physical Society 
from Testa et al. (1982) 



adjacent neighbour. From one bifurcation to the next, separation of adjacent elements in 
a pair is reduced by a universal factor a. = 2.5029 (Figure 14.16). 



2. After a spectral component in the period doubling process has been generated, its 
amplitude remains approximately constant during further bifurcation and each new 
subharmonic of this frequency can be predicted as having its amplitude reduced by 
101og 10 /i where 



M = 




1/2 



6.57 



A typical varactor LCR circuit is shown in Figure 14.17 with the non-linear capacitance 
given by 

C(V) = Co/(l + V c /0 y 

where V c is the voltage across the varactor. In Testa’s experiment Co = 300 pF, 
(3 = 0.6,7 — 0.5, L — lOmH and R = 2812. For low values of Vo this gave a high Q 
resonance circuit at a frequency of 93 kHz. With /fixed near the resonance frequency in the 
driving voltage Vo sin 2 tt/7, Vo was varied and the varactor voltage V c ( t) was measured. 
Testa et al. assumed that Vo played the role of A in the logistic equation and that V c 
corresponded to x. A real time display on a double beam CRO of V c (t) and Vo(f) clearly 
revealed threshold values of Vo„ for bifurcations into subharmonics f/n where 
n = 2,4, 8, 16. At n = 4 (not shown by Testa) this would appear as Figure 14.18. 

Figure 14.19 was obtained on the oscilloscope screen by Testa with a slow horizontal 
scan of Vo versus the varactor voltage V c which was magnified in selected steps of 10 mV. 
The numbers on the horizontal axis indicate the generation of particular periods and 
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Figure 14.17 Non-linear LCR series circuit where the non-linear element is the varactor C which 
acts as a diode in the forward direction but becomes a variable non-linear capacitance in the reverse 
direction 

bifurcations are clearly visible. The threshold values of Vo for these periods are shown in 
Table 14.1. The first four threshold values Vo,, gave 

<5, = V()2 ~ Vo1 =4.257 ±0.1 

^03 ~ ^02 




Figure 14.18 Double beam oscilloscope showing driving voltage l/ 0 (f) at frequency/ and varactor 
voltage V c (t) at frequency //4. Values of V 0n for appearance of f/n are given in Table 14.1 
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I.O.V. 

Figure 14.19 Slow horizontal scan of V 0 versus V c . The numbers on the horizontal axis indicate the 
generation of particular periods. Bifurcations are clearly visible. Threshold values of V 0 for various 
periods are shown in Table 14.1. Reproduced by permission of The American Physical Society from 
Testa et al. (1982) 



Table 14.1 Table of periods and the threshold values V 0 at which they appear 



Threshold Vq 

Period (rms volts) 



comments 



2 


0.639 




4 


1.567 




8 


1.785 


1 Threshold for periodic bifurcation 


16 


1.836 




32 


1.853 




Chaos 


1.856 


Onset of noise 


12 


1.901 


i Window 


24 


1.902 ; 


6 


2.073 


l 


1 Window 


12 


2.074 


5 


2.353 


Window 


10 


2.363 ' 


7 


2.693 


1 


| Window 


14 


2.696 


3 


3.081 1 




6 


3.338 




| Wide Window 


12 


3.711 




24 


3.821 1 




9 


4.145 


l 


18 


I Window 


4.154 J 



Reproduced by pemission of the American Physical Society from Testa et al. (1982) 
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and 

S 2 = Vm ~ V ° 2 = 4.275 ±0.1 
V04 — ^03 

in the Feigenbaum convergence series. 

To test the first of Eigenbaum’s predictions the values of c, a and b in Figure 14.16 were 
measured for the bifurcations between periods 16 and 32. These gave 

a = y = 2.35 and a — -=2.61 
b a 

As periods doubled the power reduction in their frequency components was measured and 
the results were consistent with Feigenbaum's analysis. 

Phase Space 

One of the most vital concepts in the description of chaos is that of phase space. In one 
dimension, e.g. the logistic equation, trajectories can be followed without introducing it. In 
higher dimensions it is essential. 

The idea of phase space has many applications in physical sciences. Students meet it 
initially in the Maxwell-Boltzmann statistical distribution where the question is asked: 
‘Given N gas particles at a temperature T occupying a volume V, what fraction of N will be 
found in the velocity range v to v + dv in the small volume range dVT We shall discuss this 
application to statistical distributions in an appendix at the end of the book. 

The number of dimensions of phase space is determined by the number of coordinates 
required to define the complete physical state of the system. For each gas particle above we 
need six dimensions, three for the v x , v y , v z components in velocity space and three for the 
x y z components in the configuration space V. 

Each point in phase space defines the complete physical state of the system (here a gas 
particle) and trajectories in phase space follow the physical development of the system. 

When the energy of an ensemble of systems (particles) is conserved the phase space or 
volume associated with them remains constant, but if any energy is dissipated the phase 
volume contracts. This contraction generates a sub-space, there is a reduction in the 
number of coordinates required and their range is reduced. 

Figures 14.20-14.23 show, in turn, the two dimensional phase space diagrams of 
different oscillators using the coordinates x and *. 

1. A linear simple harmonic oscillator (Figure 14.20). 

2. a damped simple harmonic oscillator (Figure 14.21). 

3. an undamped non-linear oscillator formed by a pendulum supported on a light rigid rod 
(Figure 14.22) 

4. (a) an undamped oscillator with a potential energy 

V = - \ax 2 +\bx A 
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Figure 14.20 Linear simple harmonic oscilLator represented in the two dimensional phase space of 
x and x. Each ellipse corresponds to a curve of constant energy and encloses a constant area of phase 
space 



x 




Figure 14.21 The energy loss per cycle in a damped simple harmonic oscillator is shown in its 
phase space diagram as a reduction of area with each cycle as its trajectory spirals to a stable point 
attractor at the origin 
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Figure 14.22 Phase portraits for a non-linear pendulum on a light rod. The closed curves represent 
energy values up to the limit 9 = 0 at pendulum amplitude 9 = ±tv(9=0 is the hanging rest 
position). The open curves represent fast rotations with energy values large enough for 
9 > 0 at 9 = (2/7 + l)7r 



(b) the oscillator of 4(a) now lightly damped (Figure 14.23). 

The features of each will now be described, introducing ideas which are frequently met in 

chaotic systems. 

1. The trajectory in xx phase space for a simple harmonic oscillator of constant energy is 
an ellipse of constant area. Its potential energy curve is the familiar parabola of p. 10. 

2. For a lightly damped simple harmonic oscillator where energy is dissipated the phase 
space is an inward spiral on to the equilibrium zero position which is a stable point 
attractor. As energy is lost each orbit of the spiral encloses a smaller element of phase 
space than its predecessor, unlike (1). 

3. Here we plot the phase portraits for a large range of pendulum energies E. The closed 
curves represent those energy values up to the limit where the pendulum (rigid rod) 
stands on its head with zero velocity and angular amplitude 9 = ±7r measured from the 
hanging rest position. Higher E values have open curves because their rotations are fast 
enough to pass through the values of 9 = (2n + \)ir with velocities 9 > 9. The largest 
closed curve has pointed ends, at maximum amplitude 9, because 9 is small for changes 
of 9 in that range. Each interval of 2-7 t along the horizontal axis represents a complete 
rotation. 

The curves passing through 9 = ±7r evidently separate those energies capable of 
allowing complete rotations from those which cannot. Such a curve is called a 
separatrix and the points 9 = ±7r are called saddle points. 
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Figure 14.23 Potential energy curve V = - \ax 2 + |bx 4 with phase portraits for the damped and 
undamped oscilLators. For the undamped oscillator energies V(x) < 0 restrict the motion to that 
potential well containing the xx starting position, (a) When the starting position is on the curve 
V(x) > 0 the trajectories cross the potential barrier repeatedly, (b) For the damped oscillator 
trajectories from a given range of xx starting positions will finish at the bottom of a particular 
potential well (indicated by the shaded region known as the basin of attraction). The other basin of 
attraction is unshaded. Reproduced by permission of John Wiley & Sons from Thompson and Stewart 
(1986) 

4. The potential energy curve V = —\ax 2 +^bx 4 is drawn together with the phase 
portraits for the undamped and damped oscillators. For the undamped oscillator any 
starting position with total energy less than V(x) =0 restricts the motion to one or other 
of the potential wells. For any starting position greater than V(x) =0 the motion may 
cross the potential barrier repeatedly. The trajectory associated with motion starting 
from rest at any of the three V{x) =0 positions is the separatrix through the saddle 
point. 

If the oscillator now has a small damping term rx the final rest position is determined 
exclusively by its starting values x and x. The saddle connection is broken and the two 
equilibrium states are now competing point attractors. Starting positions of (x,x) which 
lie in the dotted regions of the phase space generate trajectories which will come to 
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Figure 14.24 Repellor and limit cycle. Phase trajectories of an osciLlator governed by the equation 
mx — rx + dx 3 + sx = 0. For x smaLl and r positive, trajectories spiral outwards from the repellor at 
the origin. For large x, the dx 3 term dominates and trajectories spiral inwards. These effects baLance 
at some boundary to form a stable limit cycle. Reproduced by permission of John Wiley & Sons from 
Thompson and Stewart (1986) 



equilibrium in the dotted attractor spiralling to rest at the minimum of the right hand 
potential well. Similarly the clear region of phase space defines the starting positions 
and trajectories which will finish at the minimum of the left hand potential well. Each of 
these two phase space regions is called a basin of attraction. 

Repellor and Limit Cycle 

To illustrate the concepts of repellor and limit cycle in two dimensional phase space we 
consider the damped non-linear oscillator governed by the equation 

mx — rx + dx 3 + sx = 0 

When x is very small we can neglect the dx 3 term and if r is positive we have negative 
damping giving outwardly spiralling trajectories from the central point which is therefore a 
repellor. For large values of x, dx ' is the dominant term and the trajectories spiral inwards. 
These competing effects are balanced at some boundary to form a steady state oscillation in 
a stable limit cycle of fixed period. Figure 14.24. 



The Torus in Three-dimensional (x,x, t) Phase Space 

Extending the ideas about phase space let us consider the generation of a torus by following 
the trajectory of a particle (or system) subject to the influence of two perpendicular circular 
simple harmonic motions of angular frequencies lo q and u> \ , where cuo traces a circle in the 
azimuthal plane with a radius r o while u> \ causes the particle to spiral on the surface of a 
torus of radius r \ (Figure 14.25). A cross section of the torus will be a circle of radius r\. 
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Figure 14.25 Torus in (xxt) phase space generated by a system subject to the influence of two per- 
pendicular circular simple harmonic motions. The trajectory of the system spirals on the torus surface 



and the particle will register some point on the circumference of the circle each time it 
passes the cross section. If u> 1 =luq this point will be identical for each period 
To = 27 t/cco- However, if u> \ u (l the particle will arrive at different points on the circle 
circumference after each interval To; for example, if lo i = 3u.'o/4, the particle will travel 
only | of the circumference for each tq and will register the points A B C D of Figure 14.26 
in that order. 

Such a cross section is called a Poincare section in phase space and is a vital tool in 
describing the multiple excursions of trajectories in phase space associated with chaos. It is 
always taken at some fixed interval of the system such as to, a typical example, as we shall 
see, is the period of the force driving an oscillator displaying chaotic motion. 

The Poincare section for a simple harmonic oscillator taken in the upper half plane 
containing the jc axis but normal to the x axis consists of only one spot at the maximum 
value of jc as the system passes through this position at intervals of To- A similar section for 
the damped oscillator will register a series of points between jc maximum and the origin as 
the trajectory spirals inwards. 



A 




Figure 14.26 When tui = 3iu 0 /4 in Figure 14.25, the system will register the points ABCD in that 
order at a given cross section. This is an example of a Poincare section in phase space 
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If the motion associated with uj \ is not circular the surface of the torus will be distorted. 
We shall see that it can be pulled out, crinkled and folded back on itself so that its Poincare 
section will assume remarkable shapes. When the repeated excursions of trajectories are 
located on such a surface it is called a manifold. The final state of such a distorted surface 
represents the reduced phase space which follows the dissipation of energy. Within this space 
is located the attractor to which the orbiting trajectories are bound. 



Chaotic Response of a Forced Non-linear Mechanical Oscillator 

Fifty years ago no engineer calculating the forced vibrations of a beam via the equation 

x + kx + x 3 = B cos t 

could have foreseen the complexity of response which computer simulated solutions have 
uncovered. Ueda (1980) has found no fewer than 21 distinct regions of behaviour using a 
range of B values (0-25) and k values (0—0.8) where the units are unspecified. Five of 
these 21 regions display chaos, the others contain a variety of different attractors. 
Thompson and Stewart (1986) have chosen particular B and k values from Ueda to 
illustrate many basic features of chaotic oscillators and the use of Poincare sections to 
identify them. Even with the same B and k values the long term behaviour of the oscillator 
is found to depend critically upon the starting values of jc and x and Figure 14.27 shows the 
phase trajectories and wave forms of five stable periodic motions around attractors for 
B = 0.2 and k = 0.08 where the letter A denotes the starting point in each case. 

We have already noted that one sign of impending chaos in a system is the divergence 
with time of phase trajectories from almost identical starting positions even though their 
behaviour is determined by the same equation. For a forced damped oscillator we saw on 
p. 58 that this behaviour consists of two terms, a transient which decays with time leaving 
the steady state component. 

One of Ueda’s chaotic regimens lies in the B range (6-8) and the k range (0.03-0.1) and 
Thompson and Stewart chose B = 7.5 and k = 0.05 for their illustration. Figure 14.28 
shows phase trajectories of the oscillator for two almost identical starting positions labelled 
A and a of (x,x). Because the vibration waveform of the oscillator is so irregular there is 
only one way of registering the passage of time on this two-dimensional phase diagram and 
that is by marking off the constant period tq associated with cos t of the driving force. This 
gives points B and b and the trajectory divergence is already evident. This divergence may 
be traced over many periods of to and is found to be exponential with time. We can 
associate the points B and b and their successors after each interval of to with the 
formation of our Poincare section of the torus on p. 489. Figure 14.29 shows the history of 
the single phase trajectory which started at A marked off in alphabetical order over the first 
nine periods of tq. Note that each letter represents a maximum of the driving force B cos t 
and that all letters fall on the right hand side of the x axis, that is x positive. 

Tracing this complicated trajectory on the three-dimensional (xxt) phase surface of the 
torus would separate that is time resolve, the apparent trajectory crossing points in the two- 
dimensional picture. If now only the Poincare section points A, B, C, D, etc. are plotted 
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Figure 14.27 Phase trajectories for the oscillator x + 0.08x + x 3 = 0.2 cost are seen to depend 
critically upon the starting values of x and x. The letter A denotes each starting position. Reproduced 
by permission of John WiLey & Sons from Thompson and Stewart (1986) 



over a very large number of intervals of tq they build up a shape of which that shown in 
Figure 14.30 is typical. 

Irrespective of any starting position or of the size and duration of any transients all long 
term, steady state, Poincare section points eventually settle to contribute to this pattern. It 
bears the signature of a chaotic attractor for high resolution displays a fine structure known 
as fractal. It is an example of the stretching and folding of an ensemble of steady state 
trajectories in phase space during which the trajectories become thoroughly mixed; that is, 
change from one set of close neighbours to another. The important point is that despite 
mixing, the trajectories retain their distinct identities and never merge; their time histories 
are invertible. 



A Brief Review 

We now review briefly the discussion so far in order to present a clearer picture of what we 
shall expect to identify in following sections. 

We saw on p. 474 how chaos could be approached via period doubling but that the 
symmetry of the population biology equation created an ambiguity on the route to chaos, so 
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Figure 14.28 Two phase trajectories from almost identical starting positions A and a for the 
oscillator x + O.Obx + x 3 = 7.5 cos t. After one period of the driving force the trajectories have 
diverged respectively to B and b. Reproduced by permission of John Wiley & Sons from Thompson and 
Stewart (1986) 



that no final point on a trajectory could be uniquely time reversed back to its origin. This 
essential time reversal arises from the continuity of unique solutions to the non-linear 
equations governing the system. The solution at a given time defines the complete state of 
the system and occupies a point in phase space so that, with time, the trajectory traces a 
line in phase space. However, trajectories with close origins in a chaotic attractor system 
diverge exponentially with time while the energy dissipation always associated with chaotic 
attractors requires the phase space volume to contract. To reconcile these contradictory 
features, phase space of at least three dimensions is required and the problem is resolved 
essentially through stretching and folding this phase space. The distortion of phase space 
on a torus surface is an example of this. 

To illustrate this process of stretching and folding, which we shall discuss later in more 
detail, we may consider two trajectories, originally close neighbours, which diverge as they 
spiral outwards on a plane (Figure 14.31) leaving the plane only to fold over by attraction 
and return back to the centre of the spiral. The divergence; that is, the sensitivity to initial 
conditions results from the stretching process and the folding comes from the attraction. 
The uniqueness of the trajectories in phase space ensures that they remain distinct, that they 
never merge, no matter how complex the phase space structure becomes. This complexity 
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Figure 14.29 A single phase trajectory traced over the first nine periods of the driving force in 
Figure 14.28. In three dimensional phase space the apparent crossing points would be separated by 
time resolution. Reproduced by permission of John WiLey & Sons from Thompson and Stewart (1986) 



is revealed by the fractal nature of the highly resolved Poincare section of the chaotic 
attractor in Figure 14.30. 

We now explain what is meant by fractal structure and discuss how theories of phase 
space distortion or mapping produce it. 

Fractals 

In topology a curve has a dimension of one and a surface a dimension of two. There are 
higher integral dimensions. The word ‘fractal’ was coined by Mandelbrot in 1975 to 
express the idea of a ‘shape’ with a non-integral dimension. He has since published books 
on the subject containing many beautiful computer generated patterns. The essential 
feature of all these fractal patterns is that they are self similar which means that, 
irrespective of scale, they retain the same geometric appearance. A well known example is 
the Koch snowflake. 



Koch Snowflake 

Figure 14.32 shows an equilateral triangle of side length 31. On the central section of each 
side is placed a similar triangle of side / and the process is repeated indefinitely to produce 
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Figure 14.30 Poincare section for an oscillator similar to that of Figures 14.28 and 14.29. High 
resolution displays a fractal fine structure. Reprinted with permission from 'Steady motions exhibited 
by Duffing's equation: A picture book of regular and chaotic motions', by Yoshisuke Ueda, published 
in New Approaches to Nontinear Probiems in Dynamics, pp. 311 — 322. Copyright 1980 by the Society 
for Industrial and Applied Mathematics, Philadelphia, Pennsylvania. All rights reserved 



a curve of infinite length (3/ x | x | . . .) but which encloses a finite area less than that of 
the circle surrounding the original triangle. 

Mandelbrot was first led to the idea of fractals by studying noise on a transmission line. 
He found that the pattern or the distribution of the noise remained the same whether taken 
over a period of an hour, a minute or a second; that is, self similarity prevailed. He identified 
the pattern as belonging to a Cantor set which dates from the nineteenth century and which 
G. D. Birkhoff had suggested in the 1920s might be significant in dynamical systems. 

Cantor Set 

The Cantor set (Figure 14.33) is constructed by removing the centre part l of a line of 
length 3/ and repeating the process indefinitely. We define the total set of points lying on 
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Figure 14.31 Trajectories around a chaotic attractor diverge yet remain within a bounded region. 
This is achieved by the stretching and foLding of phase space 



the line segment / to be some function /(/) and assume this total set to be preserved so that 

/(3/) = 2/(/) 

If then /(/) is considered to vary as some power 6 of 1 so that /(/) ~ l b we have 
/(3/) — 2/(0 giving (30 6 — 21 6 so that 3 6 = 2 and S = log 2/log 3 = 0.6309. This is the 
non-integral fractal dimension of the Cantor set. 






Figure 14.32 The Koch snowflake has a fractal non-integraL dimension. The final pattern has 
infinite Length but encloses a finite area less than that of the circle surrounding the original triangle 
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Figure 14.33 A Cantor set has a fractal non-integral dimension and is produced by removing the 
central third of a line and repeating the process indefinitely with the remaining segments. Poincare 
sections of chaotic attractors have a Cantor set-like structure 



(Problem 14.7) 

The importance of the Cantor set is that the highly resolved Poincare section of a chaotic 
attractor such as that on p. 493 reveals a Cantor set-like structure. It results from stretching 
the phase space and folding it closely into layers. It is the signature of a chaotic attractor 
and we now look at how this may be achieved. 

Smale Horseshoe 

The mathematical process which describes the stretching and folding of phase space is 
called mapping and a number of such maps have now been devised to produce this effect, 
e.g. the Smale horseshoe (Smale, 1963). 

In this example (Figure 14.34) a square is taken, stretched to double its length while its 
width is reduced to form a rectangle of area less than the square. The square may be taken 
as a cross section of a particular volume of phase space containing an ensemble or 
collection of trajectories the ends of which are shown as dots within the square. The 
reduction of area in the stretching process is equivalent to reducing the phase space by 
energy dissipation; at the same time it separates trajectories from their neighbours. The 
rectangle is then folded over into a horseshoe, the stretching and folding process is now 
repeated with the horseshoe again and again, so that successive cross sections reveal a 
Cantor set-like structure. The relative positions of the original trajectories are completely 
changed in this process. 
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(a) (b) 





Figure 14.34 The Smale horseshoe takes a square cross section of phase space containing an 
ensemble of trajectories (dotted ends), stretches the square to a rectangle of reduced area and folds 
the rectangle into a horseshoe. The process is repeated continuously with successive cross sections 
revealing a Cantor-set-like structure. The relative positions of the trajectories are changed in the 
process as the trajectories are mixed 



Chaos in Fluids 

Turbulence in fluids is the most widely observed of all chaotic motions. Fast flowing water 
from a tap or around a blunt obstacle loses its low speed coherence and flow symmetry. A 
satisfactory description of the behaviour is made more difficult because: 

• The theory of the liquid state is less well developed than that of gases and liquids. 

• Experimental methods have until recently used probes which disturb the state of the 
system being measured. 

The second of these difficulties has now been overcome by the development of laser- 
Doppler techniques combining the holographic system (p. 404) with the Doppler effect 
(p. 141). 

Typically, a laser beam of frequency ;/ 0 and wavelength Ao is split so that one half acts as 
a reference beam while the other is focused on a small fluid element (~ 0. 1 mm diameter) 
moving with a velocity u. This beam is scattered through an angle 0 with a frequency u s . 
The relationship between vq and u s is shown in Figure 14.35b. In Figure 14.35a the 
scattered beam joins the reference beam which is now modulated to give a component at 
the detector of the Doppler shift frequency vd — u s — vq. If ky and k s are the wave number 
vectors associated respectively with vq and u s then the component of the velocity u parallel 
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(b) 



v o^o r o 







u-(r 0 -r s ) n 
c 




Figure 14.35 (a) Scheme of the laser-Doppler technique for velocity measurements in a fluid, (b) 

The vector relationship between the scattered frequency v s , the incident laser frequency uq and the 
fluid velocity u; r is a unit vector, n the refractive index of the fluid and c the velocity of light. The 
Doppler shift frequency is v D = - u 0 



to the vector k = ko — k s depends only upon Ao, sin 0/2 and rv D . Velocities in the range of 
10~ 6 to 10 3 ms _1 are capable of being measured by this system. 

The frequency z/ D is so much greater than the frequencies associated with the fluid 
motion that the measured u(t) gives an instantaneous velocity value. Continuous records of 
u(t) over long periods may be Fourier analysed to show sharply defined frequency 
components when the flow is periodic with the appearance of broad band noise when the 
flow becomes chaotic. 

Chaos in fluids has been studied chiefly in two systems: 

1. Couette flow where the appropriate parameter is the dimensionless Reynolds number. 

2. Rayleigh-Benard convection where the parameter is the dimensionless Rayleigh 
number. This system is the model used by Lorenz in finding the original strange attractor. 

Couette Flow 

This flow was completely defined in the classic paper of G. I. Taylor (1923). In its simplest 
form it is produced in a fluid contained in the gap between two concentric cylinders with 
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radii differing by about a centimetre. One of the cylinders is fixed while the others rotates 
with an angular velocity lo although sometimes both cylinders may rotate with different 
angular velocities. The outer cylinder is usually glass, allowing observation of the flow. 
At low speeds of angular rotation the flow is symmetric in the azimuthal direction 
(Figure 14.36a). 

For flow in one dimension the relevant equation would read 



du x 

P u x~^~ 

OX 



— dp 
dx 



lid 2 u x 
dx 2 



where p is the fluid density, u x is the velocity in the x direction, p is the pressure and /j is 
the fluid viscosity. Each term in the equation has the dimensions of force per unit volume; 
the left hand side term may be considered as an inertial force and the last term may be seen 
as the viscous force. Flow symmetry depends on the relative strengths of these forces and 
the Reynolds number is written dimensionally as 

inertial force pu 2 L 2 uL 

Re = — = — = — 

viscous force L pu p 



where p — p/ p is the kinematic viscosity and u and L are a characteristic velocity and 
length of the system. 

For Couette flow 



p 



where r; is the radius of the inner cylinder and d is the width of the cylindrical gap. 

For slow speeds, that is low Re, any departure from symmetry is overcome by the viscous 
force restoring the system to equilibrium but as Re increases with increasing lo, the inertial 
effects of any departure from symmetric flow may be too great for the restoring viscous 
force and purely azimuthal Couette flow is lost. 

This loss of symmetry for high Re first shows itself as a series of vortices around each 
azimuthal flow line, so that fluid elements follow a spiral path in the azimuthal direction 
(Figure 14.36b). These vortices, called Taylor cells, are seen to arise as follows. 

An elemental toroid of the fluid initially at radius r\, circulating at angular velocity Lo n 
is displaced to radius ri. If its angular momentum is conserved we have 



2 / 2 
=u n r 2 

where u' r is its new angular velocity. Its centrifugal force will exceed that of the fluid 
originally at r 2 circulating with angular velocity uo ri if 

\oj' n \ > \Ur 2 \ 

1 2 I 1 2 

Hence an instability develops if |u; ri ri| > \oj r2 r 2 \~ for r 2 > r 1 ; that is, if 

d 1 2 

— \tor 2 \ < 0 
dr 
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(b) 



(c) 



Re increasing 



Figure 14.36 In Couette flow a Liquid is contained in the gap between two concentric cylinders one 
of which has an angular velocity uj with respect to the other. At low Reynolds number Re the flow is 
azimuthal as in (a). As Re increases flow symmetry is lost and vortices develop (b). A further increase 
of Re develops transverse waves along the lines of vortices (c) 



This is known as the Rayleigh criterion for the instability of Couette flow. 

When the inner layers of the fluid are moving more rapidly than the outer layers they 
tend to move outwards because the centrifugal force is greater than the pressure holding 
them in place. A whole layer cannot move out uniformly because the outer layers are in the 
way so it breaks into cells which circulate. 

The rotational motion of a fluid element in a Taylor cell appears as a periodic velocity 
variation in the z direction of Figure 14.36. Increasing Re that is the angular velocity of the 
cylinder, now causes harmonic oscillations of the vortices in the z direction as transverse 
waves travel around the azimuthal torus (Figure 14.36c). The frequency of these waves will 
be registered via the velocity measurements and as Re increases still more, other 
frequencies are generated and broad band noise begins to dominate with the appearance of 
chaos (Figure 14.37). 

Rayieigh-Benard Convection 

In this process heat provides the energy driving asymmetries in the flow. The incom- 
pressible fluid is contained between two horizontal plates about a centimetre apart, the 
lower of which is heated. For a small constant temperature difference between the plates 
the thermal conductivity and viscosity of the fluid ensure that the heat is conducted 
upwards in an orderly fashion (Figure 14.38a). When the temperature gradient is too steep 
the effect of these forces in maintaining equilibrium is overcome, flow symmetry is 
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Figure 14.37 The number of frequencies of the waves in Figure 14.36c increases with Re but broad 
band noise begins to dominate with the appearance of chaos in the bottom figure. Reproduced by 
permission of the American Institute of Physics from Swinney and Gollub (1978) 



lost and convective rolls in both clockwise and anti-clockwise directions can develop 
(Figure 14.38b). 

This occurs at some critical value of the Rayleigh parameter which we derive from the 
relevant equations. These are, in the positive z direction 




pgaAT 



In the last term of the first equation g is the acceleration due to gravity, a is the thermal 
expansion coefficient and AT is the constant temperature difference between the plates. 
This term is the buoyancy force which drives the warmer, less dense, liquid upwards. In the 
second equation K is the thermal diffusivity (p. 190) and equals k/ pC p where k is the 
thermal conductivity and C p is the specific heat at constant pressure. 
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(b) 



Figure 14.38 (a) at Low Rayleigh numbers Ra fluid in a Rayleigh -Benard cell conducts heat away 

from the base in a symmetric fashion. At some critical value Ra c flow symmetry is lost (b) and 
convective rolls develop in clockwise or anti-clockwise directions 



In the first equation the buoyancy force responsible for upward motion is opposed by the 
viscous term. If the strength of these forces is comparable, a low pressure gradient in the fluid 
will keep the inertial force on the left hand side low enough for the flow to remain symmetric. 
Comparable values of the buoyancy and viscous terms will give 



d 2 u z 



gaAT 



to yield some characteristic velocity 



gaATL 2 

u ~ 

where L, a characteristic length, is usually the depth of the liquid. 

The second equation determines the temperature distribution and the ratio 



(14.1) 



u z dT/dz UL 
Kd 2 T/dz 2 ~ ~K 



(14.2) 



tells us that for K large enough the thermal conductivity will distribute the heat rapidly 
enough for the symmetric conduction process to prevail. Combining (14.1) and (14.2) using 
the common factor U gives the Rayleigh number 

D gaATL 3 

Ra = 

/iK 



When the Rayleigh number is small enough, /; and K govern the conduction process. At 
some critical Rayleigh number Ra c convective fluid motion driven by AT replaces pure 
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Figure 14.39 The development of frequencies in the velocity flow spectrum at the criticaL Rayleigh 
number Ra c with the onset of noise as chaos sets in (bottom figure). Reproduced by permission of 
the American Institute of Physics from Swinney and Gollub (1978) 



heat conduction, instabilities develop and the flow becomes asymmetric. At the critical 
value Ra c convective rolls in the right or left handed direction begin to show, with a single 
frequency and its harmonics appearing in the velocity flow spectrum. Increasing Ra beyond 
Ra c introduces further frequency components which are followed by the onset of noise as 
chaos sets in (Figure 14.39). 

The Strange Attractor of Lorenz 

Lorenz (1963) used the Rayleigh-Benard process as the basis of his model of atmospheric 
convection in assessing the possibility of long range weather forecasting. The physical 
model is so restricted that it yields only the most rudimentary information about weather 
patterns, enough however to show that long range forecasting is not feasible because phase 
trajectories starting from almost identical positions diverge after a relatively short time. 

The two-dimensional convection rolls which appear in the rectangular cross section of 
Figure (14.38b) when Ra > Ra c can be described by two velocity components together 
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Figure 14.40 The first mode X(t) in the Lorenz equations gives a single convective roll, cLockwise 
forX positive, anti-clockwise forX negative. Warm rising fluid in this mode indicates where X and Y 
have the same sign. The ratio h/l determines the geometric factor b in the Lorenz equations 



with the deviation of the temperature from the linear conduction profile of low Ra. These 
three quantities, two of velocity and one of temperature, were expanded in two- 
dimensional Fourier series with terms (modes) of the form A ij(t) sin k,x sin kjz (p. 248) 
where the time dependence now appears in the amplitude coefficient. These expansions 
were used in the hydrodynamic equations of the last section to produce an infinite set of 
ordinary differential equations, but Lorenz reduced this number to three by considering 
only the first three modes of the Fourier expansion. 

The first mode X(t) determined by the velocity components gives a single convective roll 
filling the rectangular cell (Figure 14.40). The second mode 7(f) describes the temperature 
differences between ascending and descending currents in the convective roll and the third 
mode Z(f) represents the departure from linearity of the vertical temperature profile. 

Each mode is a phase space coordinate and the modes XYZ represent the physical state of 
the system at a given time. 

The Lorenz equations take the form 

X=a(Y -X) 

Y = rX — Y — XZ 
Z = XY — bZ 

where cr is the ratio of the fluid viscosity to its thermal conductivity, r is the ratio Ra/Ra c 
and Ms a geometric factor governed by the ratio h/l (height/length) of the cell in 
Figure 14.40. Lorenz took a = 10 (the approximate value for water) and h = 8/3. 

To show that the volume of phase space containing the trajectories decreased with time, 
Lorenz used a transport theorem of fluid dynamics relating the space rate of change of 
vectors describing a flow integrated over a volume V to the time rate of change of the same 
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volume. The vector in phase space may be written as F(X. Y. Z ) to give 



d 

df 



V(t) 



I divFdV 

J v 



Div F from p. 203 is given by 

dX dY dZ 
0X + dY + dZ 

with a value of —(a + b+ 1) = —13.67 in Lorenz’s equations, so dV(t)/dt is negative. 

This reduction in phase space volume indicates that the trajectories will eventually be 
confined to some limiting manifold. 

The overall behaviour of the system can be conveniently divided into various ranges of 
the value of r = Ra/Ra c . 

When 

X = Y = Z = 0 

there are three solutions to the Lorenz equations. These are 

(1) X=Y = Z = 0 

(2) X= Y = +[b(r- 1)] 1/2 :Z= (r - 1) 

( 3 ) x= Y = -[b(r- 1)] 1/2 :Z= (r- 1) 

When r < 1 solution (1) corresponds to a steady process of pure conduction with no 
convection, typical behaviour for small AT. Solutions (2) and (3) correspond to states of 
steady convection which exist only when r > 1 . 

If there is now a small perturbation from the condition X = Y = Z = 0 the behaviour of 
(1) remains stable as pure conduction for r < 1, trajectories moving to the origin X = 
Y = Z = 0 as a point attractor. As r increases beyond unity, steady convection will give 
way to the right and left handed convective rolls of solutions (2) and (3) which now 
correspond to separate stable attractors each with its own basin of attraction and set of 
spiralling trajectories. 

At 13.9 the separation between the basins of attraction is lost and trajectories move 
between (2) and (3) before settling on one or the other. At r k 24.7 (2) and (3) lose their 
stability as limit cycles and beyond this value of r the trajectories form two connecting 
bands, one centred on (2), the other on (3). (2) and (3) are now chaotic attractors with 
trajectories orbiting aperiodically around one before switching to the other. 



Problem 14.1 

If the period of a pendulum with large amplitude oscillations is given by 



T = 



To 1 + 
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where To is the period for small amplitude oscillations and do is the oscillation amplitude, show that 
for do not exceeding 30°, T and Tq differ by only 2% and for 6q = 90° the difference is 12%. 

Problem 14.2 

The equation of motion of a free undamped non-linear oscillator is given by 

mix = —f{x) 



Show that for an amplitude xq its period 



to 




dx 

. o [F(x 0 ) - F(x )] 1/2 ’ 



where F(xq) 



‘XQ 



fix) dx 

J o 



Problem 14.3 

The equation of motion of a forced undamped non-linear oscillator of unit mass is given by 

x = s(x) = F o cos cot 

Writing s(x) = s \X + s$x 2 , where sq and s 3 are constant, choose the variable uit = <j>, and for 
53 <C si assume a solution 



x = (a„ cos — <j> + b n sin —<j>\ 

n— 1 

to show that all the sine terms and the even numbered cosine terms are zero, leaving the fundamental 
frequency term and its third harmonic as the significant terms in the solution. 

Problem 14.4 

If the mutual interionic potential in a crystal is given by 




where r 0 is the equilibrium value of the ion separation r, show by expanding V about Vq that the ions 
have small harmonic oscillations at a frequency given by to 2 ~ 72 Vq / mr\, where m is the reduced 
mass. 

Problem 14.5 

The potential energy of an oscillator is given by 

V(x) = \ kx 2 — | ax 2 

where a is positive and <C k. 

Assume a solution x = A cos cot + B sin 2 ojt + x 1 to show that this is a good approximation at 
Wg = LO 2 = k/m if X\ = aA 2 /2 u)q and B = —aA 2 /6ujl, where a = a/m. 

Problem 14.6 

Prove that when A > 0.75 in Figure 14.11 then the slopes of/ 2 (x) at x* and x* are the same. 
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Problem 14.7 

Use the arguments in the paragraph on the Cantor set (p. 495) to show that the Koch snowflake has a 
fractal dimension of 1.2618. 



Recommended Further Reading 

Non-linear Dynamics and Chaos by Thompson, J. M. T. and Stewart, H. B., Wiley, New York (1986). 
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Non-linear Effects in Acoustic Waves 

The linearity of the longitudinal acoustic waves discussed in Chapter 6 required the 
assumption of a constant bulk modulus 

B dP 
dV/V 

If the amplitude of the sound wave is too large this assumption is no longer valid and the 
wave propagation assumes a new form. A given mass of gas undergoing an adiabatic 
change obeys the relation 



fv 0 y_ 


Vo 


V v ) 


_V 0 (1 + 6)_ 



in the notation of Chapter 6, so that 



8P dp 
dx dx 



- 7 Po(l + «) 




since 6 = dq/dx. 

Since (1 + 6)(1 + s) — 1, we may write 



dp 

dx 



—'yP o(l + 5 ) 7+1 



d 2 r] 
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Pressure 

(a) (b) (c) (d) 

Figure 15.1 The Local sound velocity in a high amplitude acoustic wave (a) is pressure and density 
dependent. The wave distorts with time (b) as the crest overtakes the lower density regions. The 
extreme situation of (c) is prevented by entropy-producing mechanisms and the wave stabilises to an 
N type shock-wave (d) with a sharp leading edge 




and from Newton’s second law we have 



dp d 2 r/ 

]h = ~ Po ~dt 2 



so that 



d 2 rj 

dt 2 



Cq(! + «) 



7+1 



d 2 ri 
dx 2’ 



where 




(15.1) 



Physically this implies that the local velocity of sound, co(l + s) 1 Vp2 , depends upon 
the condensation s, so that in a finite amplitude sound wave regions of higher density and 
pressure will have a greater sound velocity, and local disturbances in these parts of the 
wave will overtake those where the values of density pressure and temperature are lower. 

A single sine wave of high amplitude can be formed by a close fitting piston in a tube 
which is pushed forward rapidly and then returned to its original position. Figure 15.1a 
shows the original shape of such a wave and 15.1b shows the distortion which follows as it 
propagates down the tube. If the distortion continued the wave form would eventually 
appear as in Figure 15.1c, where analytical solutions for pressure, density and temperature 
would be multi valued, as in the case of the non-linear oscillator of Figure 14.3c. Before 
this situation is reached, however, the wave form stabilizes into that of Figure 15. Id, where 
at the vertical ‘shock front’ the rapid changes of particle density, velocity and temperature 
produce the dissipating processes of diffusion, viscosity and thermal conductivity. The 
velocity of this ‘shock front' is always greater than the velocity of sound in the gas into 
which it is moving, and across the ‘shock front’ there is always an increase in entropy. The 
competing effects of dissipation and non-linearity produce a stable front as long as the 
wave retains sufficient energy. The /V-type wave of Figure 15. Id occurs naturally in 
explosions (in spherical dimensions) where a blast is often followed by a rarefaction. 

The growth of a shock front may also be seen as an extension of the Doppler effect 
(p. 141), where the velocity of the moving source is now greater than that of the signal. In 
Figure 15.2a as an aircraft moves from S to S' in a time t the air around it is displaced and 
the disturbance moves away with the local velocity of sound vs- The circles show the 
positions at time t of the sound wave fronts generated at various points along the path of the 
aircraft but if the speed of the aircraft u is greater than the velocity of sound vs regions of 
high density and pressure will develop, notably at the edges of the aircraft structure and 
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Figure 15.2 (a) The circles are the wavefronts generated at points S along the path of the aircraft, 

velocity u > v s the velocity of sound. Wavefronts superpose on the surface of the Mach Cone (typicaL 
point P) of half angle a= sin _1 i;s /u to form a shock front, (b) At point P sound waves arrive 
simultaneously from positions A and B along the aircraft path when (u/v s) cos 8 =1. (8 + a = 90°) 

along the conical surface tangent to the successive wavefronts which are generated at a 
speed greater than sound and which build up to a high amplitude to form a shock. The 
cone, whose axis is the aircraft path, has half angle a where 

vs 

sin a = — 
u 

It is known as the ‘Mach Cone’ and when it reaches the ground a ‘supersonic bang’ is heard. 

The growth of the shock at the surface of the cone may be seen by considering the sound 
waves in Figure 15.2(b) generated at points A (time t ,\) and B (time t| S ) along the path of 
the aircraft, which travels the distance AB = x — uAt in the time interval At = ts — t\. 
The sound waves from A will travel the distance ro to reach the point P at a time 



to — *a H 

Vs 



Those from B will travel the distance r\ to P to arrive at a time 



r i 

t i — f b -pi — 
Vs 
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If x is small relative to r o and r \ , we see that 

r\ — r o « x cos 6 = uAtcosO 



so the time interval 



t i — to = tB ~ tx + 



(''l - r o) 



Vs 



= Ar- 



if At cos 0 
Vs 



= At 1 - 



u cos 0 
Vs 



For the aircraft speed n < t>s, t\ — to is always positive and the sound waves arrive at P in 
the order in which they were generated. 

For u > v s this time sequence depends on 6 and when 

u 

— cos 0 = 1 
vs 



1 1 = to and the sound waves arrive simultaneously at P to build up a shock. 
Now the angles 0 and a are complementary so the condition 



cos 0 = 



vs 



u 



defines 



vs 

sin a — — 



u 



so that all points P lie on the surface of the Mach Cone. 

A similar situation may arise when a charged particle q emitting electromagnetic 
waves moves in a medium of refractive index greater than unity with a velocity v q which 
may be greater than that of the phase velocity v of the electromagnetic waves in the 
medium (v < c). A Mach Cone for electromagnetic waves is formed with a half angle a where 

v 

sin a = — 
v q 

And the resulting ‘shock wave’ is called Cerenkov radiation. Measuring the effective 
direction of propagation of the Cerenkov radiation is one way of finding the velocity of the 
charged particle. 



Shock Front Thickness 

The extent of the region over which the gas properties change, the shock front thickness, 
may be only a few mean free paths in a monatomic gas because only a few collisions 
between atoms are necessary to exchange the energy required to raise them from the 
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equilibrium conditions ahead of the shock to those behind it. In a polyatomic gas the 
collisions are effective in producing a rapid increase in translational and rotational 
mode energies, but vibrational modes take much longer to reach their new equilibrium, so 
that the shock front thickness is very much greater. 

Within the shock front thickness the state of the gas is not easily found, but the state of 
the gas on one side of the shock may be calculated from the state of the gas on the other 
side by means of the conservation equations of mass, momentum and energy. 

Equations of Conservation 

In a laboratory, shock waves are produced in a tube which is divided by a diaphragm into a 
short high-pressure section and a much longer low-pressure section. When the diaphragm 
bursts the expanding high pressure gas behaves as a very fast low-inertia piston which 
compresses the low pressure gas on the other side of the diaphragm and drives a shock 
wave down the tube. The profile of this shock wave is the step function shown as the dotted 
line in Figure 15.3, and the gas into which the shock is propagating is considered to be 
at rest. This simplifies the analysis, for we can consider the situation in Figure 15.3 as it 
appears to an observer O travelling with the shock front velocity u \ into the stationary gas. 
The shock front is located within the region bounded by the surfaces A and B of unit area, 
each of which remains fixed with respect to the observer. The stationary gas which moves 
through the shock front from surface B acquires a flow velocity u < u \ and a velocity 
relative to the shock front of mi = — u. From the observer's viewpoint the quantity of 

gas flowing into unit area of the region AB per unit time is p\U\, where p \ is the density of 



^ Shock front velocity u 1 



ol 



q * Observer on shock front 



Shocked gas 


1 


Stationary gas 


Density p 2 


1 


Density p, 


Pressure P 2 


1 


Pressure P t 


Flow velocity u 


1 


Relative velocity 


Relative velocity 


1 


u i ■*— 


u 2 = u,-u 


1 





Unshocked 
gas at rest 



A 



B 



Figure 15.3 The pressure 'step proflLe' of a shock wave developed in a shock tube is shown by the 
dotted line. The plane cross-sections at A and B remain fixed with respect to the observer 0 moving 
with the shock front at velocity u i into unshocked gas at rest of pressure p\ and density p\. The 
shocked gas has a pressure p 2 , a density p 2 and a velocity u, with a relative velocity u 2 = u i — u 
with respect to the shock front. The states of the gas at A and B are related by the conservation 
equations of mass, momentum and energy across the shock front. Experimental measurement of the 
shock velocity U\ is sufficient to determine the unknown parameters if the stationary gas parameters 
are known 
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the gas ahead of the shock. The quantity leaving unit area of AB per unit time is 
P2{u\ — u) = P2U2, where pi is the density of the shocked gas. 

Conservation of mass yields p\u \ — pi«2 = m (a constant mass). The force per unit area 
acting across the region AB is P2 — p 1, which equals the rate of change of momentum of 
the gas within the unit element, which is m(u \ — u 2)- The conservation of momentum is 
therefore given by 



Pl + pyu\ =P2 + PlU 2 2 - 



The work done on unit area of the region per unit time is p \u \ — P2U2, and this equals the 
rate of increase of the kinetic and internal energy of the gas passing through unit area of the 
shock wave. 

The difference 



Pi P 2 

P\U\ — p 2 « 1 = m m 

Pl P 2 

so that if the internal energy per unit mass of the gas is written e( p, p), then the equation of 
conservation of energy per unit mass becomes 

1 1 pi 1 1 p 2 

-u\ + e 1 H — - u 2 +e 2 H 

2 Pi 2 p 2 

where for an ideal gas p/ p — RT and e = c v T = (I/7 — 1 )p/p, where T is the absolute 
temperature, c v is the specific heat per gram at constant volume and 7 = c p /c v , where c p is 
the specific heat per gram at constant pressure. 

These three conservation equations 



p \ U \ — P2W2 = ni (mass) 

P \ + P\u\ = P2 + P2K1 (momentum) 



and 

1 1 Pl 1 1 Pl , X 

x«i+«H = -u^ + e 2 +— (energy) 

z Pi Z p 2 

together with the internal energy relation e(p, p) completely define the properties of an ideal 
gas behind a shock wave in terms of the stationary gas ahead of it. 

In an experiment the properties of the gas ahead of the shock are usually known, leaving 
five unknowns in the four equations, which are the shock front velocity u 1, the density of 
the shocked gas p 2, the relative flow velocity behind the shock u 2 , the shocked gas pressure 
p 2 and its internal energy <?2- In practice the shock front velocity u\ is measured and the 
other four properties may then be calculated. 

Mach Number 

A significant parameter in shock wave theory is the Mach number. It is a local parameter 
defined as the ratio of the flow velocity to the local velocity of sound. The Mach number of 




Ratios of Gas Properties Across a Shock Front 



511 



the shock front is therefore M s — ui/c\, where u \ is the velocity of the shock front 
propagating into a gas whose velocity of sound is c\. 

The Mach number of the gas flow behind the shock front is defined as Mf = u/c 2 , where 
u is the flow velocity of the gas behind the shock front (u < u \ ) and c 2 is the local velocity 
of sound behind the shock front. There is always an increase of temperature across the 
shock front, so that C 2 > c\ and M s > Mf. The physical significance of the Mach number 
is seen by writing M 2 = u 2 /c 2 , which indicates the ratio of the kinetic flow energy, 
\u 2 mol -1 , to the thermal energy, c 2 = 7 RT mol -1 . The higher the proportion of the total 
gas energy to be found as kinetic energy of flow the greater is the Mach number. 



Ratios of Gas Properties Across a Shock Front 

A shock wave may be defined in terms of the shock Mach number M s , the density or 
compression ratio across the shock front (3 = p 2 / pi, the temperature ratio across the shock 
T 2 /Ti and the compression ratio or shock strength y = pijpi- 
Given the shock strength, y = p 2 / Pi, the conservation equations are easily solved to 
yield 



where 



and 




h + a \ 

\\ + a) 



1/2 



(i = — = a + y 

Pi 1 + ay 



Tt-J 1 + oqA 

T 1 ' V V a + y ) 



Alternatively these may be written in terms of the experimentally measured parameter M s as 

— — y — M 2 ( 1 + a) - a 




Pi 1 — a + aM^ 



T 2 _ [a(M; - 1 ) +M;][a{M 2 - 1 ) + 1 ] 
Ti ~ Ml 



For weak shocks (where p 2 /pi is just greater than 1 ) (3, T 2 /T 1 and M s are also just greater 
than unity, and the shock wave moves with the speed of sound. 
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Strong Shocks 

The ratio P2/P1 1 defines a strong shock, in which case 



M: 



(7+1) , 

27 



and 



0 



P2^ h + 1 \ 
P 1 \7 — 1/ 



a limit of 6 for air and 4 for a monatomic gas for a constant 7. The flow velocity 



U — U I — ll 2 



2 u 1 

(7+I) 



and the temperature ration 



Ti 

Ti 




( 7 + 1 ) 



The temperature increase across strong shocks is of great experimental interest. The 
physical reason for this increase may be seen by rewriting the equation of energy 
conservation as \u\ + h 1 = \u) T li 2, where h = (e + p/p) is the total heat energy or 
enthalpy per unit mass. For strong shocks /?2 ^ h 1 of the cold stationary gas and u \ 'JP u 2, 
so that the energy equation reduces to /12 ~ jM 2 , which states that the relative kinetic 
energy of a stationary gas element just ahead of the shock front is converted into thermal 
energy when the shock wave moves over that element. The energy of the gas which has 
been subjected to a very strong shock wave is almost equally divided between its kinetic 
energy and its thermal or internal energy. This may be shown by considering the initial 
values of the internal energy e 1 and pressure p \ of the cold stationary gas to be negligible 
quantities in the conservation equations, giving the kinetic energy per unit mass behind the 
shock as 



5H 2 = 5 (mi - M2) 2 = e2 

the internal energy per unit mass of the shocked gas. 

In principle, the temperature behind very strong shock waves should reach millions of 
degrees. In practice, real gas effects prevent this. In a monatomic gas high translational 
energies increase the temperature until ionization occurs and this process then absorbs 
energy which otherwise would increase the temperature still further. In a polyatomic gas 
the total energy is divided amongst the various modes (translational, rotational and 
vibrational) and the temperatures reached are much lower than in the case of the 
monatomic gas. The reduction of 7 due to these processes is significant, since with 
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increasing ionization 7 — > 1 , and the temperature ratio depends upon the factor 
(7 — 1 ) / (7 + 1 ) which becomes very small. 

(Problems 15.1, 15.2, 15.3, 15.4, 15.5, 15.6) 

Solitons 

We have seen that a pulse, limited in space, is also limited in time. Fourier analysis shows 
that a pulse is the superposition of a large number of components with different frequencies 
and that the high frequency components contribute to the vertical edges of the pulse Fig- 
ure 10.3. The superposition of these components changes as phase differences develop; 
different frequencies will have different phase velocities and the pulse disperses. 

It is surprising, therefore, that high amplitude solitary waves or solitons are known to 
exist. The first recorded observation of a soliton is that of Scott-Russel (1844) who saw a 
single wave about 40 cm high travelling along a canal in Scotland. Rayleigh (1876) 
developed an expression for the shape of this soliton based on the hydrodynamics of waves 
in shallow water. 

That expression, the bell-shaped Figure 15.4 is given by 

rj — a sech 2 a(x — xq) 



where 



_ 1 I 3a 
2 y h 2 (h + a) 




Figure 15.4 The soLitary wave (soLiton) on a shallow canal seen by Scott-Russel (1844) was 
described as a sech 2 bell-shaped function by Rayleigh (1876). The canal depth is h, the soliton 
amplitude is a and 77 measures a displacement on the soliton curve. The soliton is centred at x 0 and a 
is a function of a and h 
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r], a, h and *0 are all shown in Figure 15.4. The coordinate xq about which the static figure 
is centred is replaced by ct when the soliton is moving; c is the soliton velocity and t is the 
time. We shall see that c is related to the height of the soliton. Larger amplitude solitons 
move faster. 

Further sightings of solitons on Dutch canals led to a thorough discussion of waves with 
finite amplitude in shallow water by Korteweg and de Vries (1895). Their equation 
describing soliton behaviour is known as the KdV equation and is now taken as the basis of 
soliton theory. We shall not pursue the relevant fluid dynamics necessary to obtain the KdV 
equation but we shall obtain its mathematical form by a method which may lack formal 
rigour but which provides a good working model. It also emphasizes the physical 
characteristics which produce a soliton. 

The underlying physics of solitons is the competition between two processes. One of 
these causes a high amplitude or non-linear wave to break; we have seen this in the 
formation of a shock wave in Figure 15.1c. This results from the increased phase velocities 
of the high amplitude non-linear components of the wave. 

In a soliton this is opposed by the dispersion of the wave components in such a way that 
a stable profile is maintained. 

We shall derive the form of the KdV equation and then discuss the following topics: 

• Solitons, Schrodinger’s equation and elementary particles. 

• Solitons in optical fibres. Telecommunications.. 

A list of references is given at the end of the chapter. 



Non-Linearity 

Equation (15.1) shows that the higher amplitude components of an acoustic wave 
propagate with a phase velocity 

U= § = Co(1+5) 7+V2 



where co is the phase velocity of a small amplitude linear wave and ,v, the condensation, is a 
measure of the compression in the wave. 

We may expand this, to a first order, to give 



dx 

V= fr = C « 




(15.2) 



In a linear, low-amplitude, right-going wave we have 



V = 



Nin ^ 



i(u )t—kx) 
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So, denoting dq/dt as q, and dq/dx as 77 x we have 



Vt/Vx = 



—ii! 

~k 



= ~c 0 



or 



fit + Co? h = 0 



(15.3) 



Throughout this chapter we shall indicate partial differentiation with respect to a variable 
by writing that variable as a subscript. Thus, q t — dii/dt\ rj x — dq/dx\ q n = d 2 q/dt 2 and 
Vxx = d 2 q/dx 2 . Replacing co in equation (15.3) by v in equation (15.2) gives 



V t + c 0 





Vx = 0 



which, because i = kq is in phase with q, (Figure 6.2), becomes 



Vt + co 



7 + 1 



kq 



fix — q t + coq x + c 0 



7 + 1 



kqq.x = 0 



(15.4) 



We are interested in non-linear effects and after removing the linear contribution of 
equation (15.4) we are left with the non-linear expression 

q t + bqq x = 0 (15.5) 



where 



b = 



co 




k 



Equation (15.5) provides the first two terms of the KdV equation. We now consider the 
third, the dispersion term, which competes with the non-linear bqq x term. 



Dispersion and the Form of the KdV Equation A typical dispersion equation is that for 
transverse and longitudinal waves in a periodic structure given by equation (5.12) as 



v = 



U) 

k 



co 



sin kci/2\ 

ka/2 J 



where k is the wave number and a is the particle separation. For small k, long A, we may 
expand the sine term to give 



v — 



U) 

k 



co ka 
kajl Y 




3 

••• + 
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or 



where 



u; = cok 1 




cob — dk 



3 



d = coa 2 /4 



Writing a linear wave in the form 



(15.5a) 



gives 



V = r)„, e 



i(u Jt—kx) 



T]t = i UJT], rj x = -ikri and r] xxx = ik 3 r] 



which, with equation (15.5a), gives 

Vt + c 0 r]x + dr) xxx = 0 



Again, the contribution r/ t + cor/ A - applies only to linear waves and replacing this for non- 
linear waves by equation 15.5 

Vt + bvVx 

gives 

Vt + hWx + dVxxx = 0 (15.6) 

where b and d are constant coefficients. This is the form of the KdV equation which 
describes soliton behaviour. The coefficients b and d depend upon the particular soliton 
under discussion. 

We gain an insight into the effect of the dispersion term by considering the following. 
Let us write a right-going linear wave in the form 



„ — n p'M-fa) _ n p i k(c 0 t-x) 

'I — i/m c — 1 lm u 



where 



to = cok 



The effect of dispersion, from the previous section, changes uj = cok to 



so we have 



uj = cok 1 




2 




1 - 




— X 



V = V m exp 
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and dispersion has the effect of shifting the wave. Note that in this case of normal 
dispersion the shift retards the higher k, shorter wavelength terms. 

Mathematically, this dispersive shift is used to offset the steepening, wave breaking 
effects of non-linearity. The technique, known as a Gardner-Morikawa transformation, is 
to choose a coordinate system which moves with the velocity c o, the pulse rides on this 
moving coordinate so that dispersion relative to c o is much reduced. In addition, because 
any dispersive change is now so much slower, a much longer time scale t> t is chosen and 
the final aim is to show that changes in the soliton profile are negligible in the r time scale. 

The Elements of the KdV Equation Although we derived the form of the KdV equation 
using the amplitude 77 , the equation is most often written in terms of a quantity a which 
may represent any property of the wave which varies with distance and time. 

In their paper ‘The Discovery of the Soliton’ (1965) Zabusky and Kruskal used the 
equation in the form 



- 0 



(15.7) 



where 8 1 . 

Their experiment was made by computer simulation. In the absence of the third 
dispersive term the non-linear equation 



Uf T uu x — 0 (15.8) 

describes the development of the shock wave of Figure 15.1. The positive pulses of Figure 
15.1a, b and c are superposed in Figure 15.5 with u plotted against x. It is evident that u, 
increases with higher values of u and equation (15.8) retains a single valued solution only 
as long as the gradient u x of the leading edge becomes increasingly negative as the pulse 
steepens. 

Now equation (15.8) is satisfied by any function u —f(x — ut ) — see Problem 15.7 — and 

u x = {\-u x t)f (15.9) 




Figure 15.5 Figs. 15.1 (a), (b) and (c) superimposed to show breaking of a non-linear wave 
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A A' 




Figure 15.6 The velocity of a soLiton increases with its magnitude and solitons are transparent in 
mutual collisions, each retaining its own identity. A large soliton A overtakes a smaller soliton B to 
emerge as A' with B unaffected 



where 

/' = df/d(x - ut) 

Taking the pulse profile at t = 0 as it = fix) = cos nx equation (15.9) shows that 
u x — —oo at u k. 0 (the foot of the pulse) when x = 0.5 and t — l/ir. At this point the wave 
becomes infinitely steep and breaks. This behaviour was observed by Zabusky and Kruskal. 
When Zabusky and Kruskal added the third dispersion term in their computer experiment 
to give the KdV equation 



Ilf T till x T b 1 1 v v.v — 0 

they found that after a time t = \/n the solution broke into a train of solitary waves 
(solitons) of successively larger amplitudes with the larger waves travelling faster than the 
smaller ones. Even more important from the point of view of optical solitons, after one 
soliton had overtaken another, each soliton retained its unique identity (Figure 15.6). 
Solitons are transparent to each other and are unaffected by mutual collisions. 

(Problems 15.7, 15.8) 

Two Important Forms of the KdV Equation 



1. The KdV equation for shallow water waves may be written in the form 

u t + 6 uu x + u xxx — 0 



with a solution 



i(x, t) — 2ct 2 sech 2 a(x — ct) 

= 2^2 log [1 +e 2 “(*- rt )] 



(15.10) 



u(x,t) = 2|^log[l+e- 2 “^)] 



or 
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Figure 15.7 The KdV equation u t + 6uu x + u xxx = 0 has a soliton solution u(x,t) = 
2a 2 sech 2 a(x — ct) with a maximum value of 2a 2 



Note that the exponents in the log solutions may be positive or negative. 

The sech 2 form of the solution may be seen to fit equation (15.10) with a soliton 
velocity c = 4a 2 (twice the maximum value of u) by showing that 

u t = 2 auc tanh <j>, where <f> — a(x — ct) 
uu x — —2cm 1 tanh r/> 



and 



u.xxx — —8 a 2 u tanh cj) + 12 au 2 tanh (f> 

The sech 2 shape of the soliton is shown in Figure 15.7. Its peak value is 

u = 2a 2 



(Problems 15.9, 15.10) 

2. The second important form of the KdV equation is 

Uf — 6uu x + u xxx — 0 (15.11) 

(the shallow water wave form with a negative second term). This has a time 
independent soliton solution of 

u(x) — —2a 2 sech 2 (x — xo) 

where xo locates the centre of the soliton. This solution may be shown to satisfy 
equation (15.11) by calculating u x and u xxx as for equation (15.10). 

A graph of this soliton. Figure 15.8, shows its minimum to have a value of —2a 2 . Its 
importance is its connection with Schrodinger’s equation, which we now discuss. 
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Figure 15.8 The KdV equation u t — 6uu x + u xxx = 0 has a time independent solution u(x) = 
— 2a 2 sech 2 a(x - x 0 ) with a minimum value of — 2a 2 . This equation is related via Miura's 
transformation to Schrodinger's equation which has an eigenvalue of A = —a 2 



(Problem 15.11) 

Solitons, Schrodinger's Equation and Elementary Particles 

In 1968, Miura found a remarkable connection between equation (15.11) and the equation 

v t + 6v 2 v + v xxx — 0 (15.12) 

which itself has a soliton solution. 

Miura showed that if v 2 + v x = u then 

(v, - 6v 2 v x + v xxx ) = u, - 6 uu x + u X xx (15.13) 




(Problem 15.12) 

So if v satisfies equation (15.12) with the sign of its second term changed, then u satisfies 
equation (15.11). Now Miura's transformation with 



yields 



v 2 + v x = u(x) and v = V'jc/ V 7 
ipxx ~ u{x) ip = 0 



(15.14) 



(Problem 15.13) 

If u(x) is now transformed to u(x) — A, where A is a constant, then equation (15.14) 
becomes Schrodinger’s equation 

ipxx + (A - u(x))ip = 0 



with A as an eigenvalue. 
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So Miura’s transformation has related the KdV equation 

u t — 6 uu x + u xxx = 0 (15.11) 

to Schrodinger’s equation 

V'x.v+ (A-n(x))V’ = 0 (15.15) 

Using the soliton solution 

u = —2 or sectr a(x — xo) 

of equation (15.11) we can show that the wave function 

ip — Asecha(x — x'o), where A is a constant (15.16) 

satishes equation (15.15) when the eigenvalue A = — a 2 which is half the value of the 
minimum of the soliton with which it is associated (Figure 15.8) (See Gardner et al., 1967). 

(Problems 15.14, 15.15, 15.16) 

Since A is negative this represents a bound state in wave mechanics. 

Other values of A > 0 may be associated with solitons but these are not bound states and 
are related to progressive waves. 

The fact that solitons may be associated with Schrodinger’s equation and retain their 
unique identities in mutual collisions has led physicists to postulate that solitons may 
appear as massive elementary particles much heavier than the proton. 

Solitons may enter particle physics in another way, confined not only in space but in 
time. In this case they are called instantons. Instantons have already been used to explain a 
pattern of particle masses which had posed a long-standing puzzle. 

There are four ways of making quark-antiquark mesons from light quarks. Three of 
these mesons have been known for many years: the negative, positive and neutral pi 
mesons (pions) with masses equivalent to about 140 MeV (an electron equivalent mass is 
- 0.5 MeV). 

The fourth meson has never been found but the eta meson has all the required properties 
except its mass which is about 550 MeV. Instantons explain this mass anomaly — they 
appear as energy excitations, located in space, in the field which binds the quarks together. 
They change the mass distribution among the mesons because they affect the various quark 
combinations in different ways (see Rebbi, 1979). 



Optical Solitons 

At the time of this writing the most practical use of solitons is in telecommunications. 
Optical fibres act as wave guides to microwaves and higher frequency electromagnetic 
waves and optical solitons are able to carry information along single mode silica fibres at 
multigigabit rates for distances greater than 9000 km, the width of the Pacific Ocean, with 
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a bit error rate (BER) < 1 0 9 , the international standard. Modern fibres have a very low 
loss rate of < 1 dB km" 1 and an effective area of ~ 30 pm 2 . The electrical power involved 
is very low and a total optical system is feasible including the amplifiers spaced along the 
cable. This permits a simpler, faster and more easily maintained system than that using 
conventional electronics. Research on optical solitons is world-wide but, for the English 
reader, the work of Linn Mollenauer and his colleagues at the A. T. & T. Bell Labs, New 
Jersey is the most accessible (see references). 

Optical solitons have the normal sech 2 intensity profile and their amplitudes are given by 
sech wave function solutions to a non-linear Schrodinger equation (see Appendix, p. 555). 

As with all solitons, optical solitons are produced by a balance between the competing 
effects of dispersion and non-linearity but the non-linearity of optical fibres is a very 
special case which contributes in a remarkable way to the maintenance of the soliton 
profile. 

The Kerr Optical Effect and Self-phase Modulation In some materials, including silica 
fibres, the index of refraction for light of a given wavelength varies with the intensity of the 
light. This is the Kerr optical effect, which is expressed by 

n — no = 112 I 

where n is the index of refraction for a light wave of intensity / (large enough for non- 
linearity), n 0 is the refractive index for a low amplitude wave of the same frequency and 111 
is a constant equal to 3.2 x 10 _16 cm 2 W . The value of n 2 is small but the area of a 

single mode optical fibre ~ 10 " 6 cm 2 , so we must think in terms of megawatts per square 
metre. Moreover, the effects of non-linearity build up over fibre distances of many 
kilometres. 

Since n 2 l is positive we have 

(X 1 \ 

n — no — c[ >0 

\V VQ J 

so the phase velocity v of a high amplitude wave is less than vo, the phase velocity of a low 
amplitude linear wave of the same wavelength. 

At a given wavelength this creates a phase retardation between the two amplitudes of 

2tt 

Acj> = —Ln 2 1 

over a length L of the fibre. This phase retardation is obviously greater for the short 
wavelength high frequency components of the pulse. Figure 15.9, than for the lower 
frequencies and so in the high intensity central section of the pulse the higher frequencies 
are shifted towards the tail of the pulse while the lower frequencies advance to the front. 

This process is opposed by the dispersive properties of the fibre because at the 
wavelength at which the solitons are centred; that is, A ~ 1.5 pm (1500 nm) the dispersion 
is negative (anomalous) so that dv g /d\ < 0, where v g is the group velocity. 

Negative dispersion advances the trailing higher frequencies and retards the lower 
frequencies, both in a direction towards the centre of the pulse, so the pulse sharpens 
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Figure 15.9 In the Kerr optical effect the velocity of light at a given wavelength depends upon its 
intensity. The high frequencies in the high intensity region of a soliton travelling in an optical fibre 
suffer a phase retardation; the low frequencies are advanced 



towards a soliton sech 2 shape. Figure 15.10, and in a loss-free perfect silica fibre the soliton 
would maintain this shape indefinitely. In practice, the wavelength A ~ 1.5 pm is close to 
the minimum of the loss versus wavelength behaviour of the fibre, which accounts for low 
loss fibres of < 1 dB km 1 . Optical amplifiers, which we shall discuss shortly, maintain the 
shape of the soliton over very long distances but even without amplification a soliton can 
travel several hundred kilometres along the fibre without changing its amplitude or shape. 

This distance is called the soliton period, Figure 15.11, and is given by 



Zo = 0.322 



9 9 9 

7T"CT" T z 

A or 039 D 



at A r -o 1.55 pm 



where c is the velocity of light in free space, A vac is the wavelength in free space, r is the 
full width at half the maximum value of the soliton and D is the group velocity dispersion 
parameter of the fibre; that is, the change in pulse delay with change in wavelength per unit 
of fibre length. 

The units of r are picoseconds and experimental solitons are produced in the range 1 - 
50 ps. The units of D are picoseconds per nanometre per kilometre and experimental values 
of D are ~ lOpsnm -1 km -1 . At D ~ 1 psnm" 1 km -1 a 50 ps pulse has a soliton period 
Zo ~ 930 km. 




Figure 15.10 The effects of Figure 15.9 are reversed by the negative (anomalous) dispersion of the 
optical fibre at the wavelength on which the soliton is centred. This sharpens the soliton pulse 
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Figure 15.11 A soliton can travel several hundred kilometres in an optical fibre without being 
degraded in any way. This distance z 0 , is called the soliton period 



Experimental Aspects Experimentally, the solitons are produced by a mode locked laser 
with an additional fibre arm in the feedback loop. As the laser builds up from noise the 
initially broad pulses are considerably narrowed by passing through the fibre arm and then 
reinjected back into the laser cavity, forcing the laser itself to produce narrower pulses. 
This process is repeated until the pulses become solitons and are ready for injection, via 
coupling, into the transmission system. The laboratory cable is a fibre spool ~ 75 km long 
and the solitons are recirculated through this loop to travel distances > 10 000 km if 
required. 

A typical laser soliton source produces pulses of ~ 50 ps with a power ~ 0.5 mW at a 
repetition rate of 2.5 GHz. 

The Raman Effect This plays a very important role in optical soliton transmission. It 
arises when molecules in a material absorb radiation and it involves the vibrational and 
sometimes the rotational energy levels of the molecules. Figure 15.12 shows the vibrational 
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Figure 15.12 The Raman effect can degrade a soliton by transferring energy from its higher 
frequency to its lower frequency components. Vibrational energy levels in the optical fibre absorb 
higher frequency radiation v from the soliton which reabsorbs it at a lower frequency n — An (Stokes 
line). There are three possible processes. In Rayleigh scattering a photon returns to its original 
vibrational energy level, the Raman effect provides a frequency change An = ±1, where An is the 
frequency interval between vibrational energy levels 
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RGP — Raman Gain Pump 

A-1470 nm RGP 




L X L X L 



Figure 15.13 The transmission Line acts as its own distributed ampLifier when solitons accept 
higher energy photons via the Raman effect from optical pumps located at short intervals (distances 
•C z 0 , the soliton period). Excessive noise production is avoided by frequent low gain amplification 
(see Mollenauer et ai, 1986) 



energy levels in a molecule with 0 as the ground state. Suppose initially that the molecule is 
in the energy level 1 and absorbs a photon of frequency v which raises it to an excited level 
which may not be a stationary state. If the photon drops back to its original level the re- 
radiated photon of frequency v is called Rayleigh scattering. However, selection rules also 
allow vibrational level changes Av — ±1, where Av is the vibrational energy level 
interval, so the photon may drop back into level 2 or 0. The re-radiated or scattered photon 
will then appear at the frequencies v — Av (Stokes line) or v + Av (anti-Stokes line). 

The Raman effect can ‘degrade' a single soliton via a process known as the ‘self- 
frequency shift’. Here the vibrational levels of the silica fibre molecules absorb energy 
from the higher frequencies in the soliton pulse and the scattered radiation acts as a Raman 
pump for the lower frequencies in the pulse because the fibre provides a Raman acceptance 
band over a broad frequency spectrum. 

Indeed, although a power of 0.5 mW provides a stable single soliton, early experiments 
showed that solitons with powers >1 W suffered from ‘self-frequency shift' to such an 
extent that the soliton initially narrowed but then formed smaller satellite solitons. 

The Raman Effect and Optical Amplification Solitons can gain energy via the Raman 
effect as well as lose it and this is the basis of amplification along an optical transmission 
line. One method results in the line acting as its own distributed amplifier. Laser pumps 
coupled into the line at regular intervals maintain the shape of a soliton by feeding in a 
frequency higher than that of the soliton, the energy difference being very close to the 
broad peak of the Raman gain band of the silica fibre. In Figure 15.13 the soliton 
wavelength is A = 1.5 pm and the lasers pump energy at A ~ 1.4 pm. The pumps can also 
inject radiation in the counter-propagating direction, which helps to average out any effect 
of pump fluctuations; the penetration of the amplifying beam along the fibre is also 
enhanced. The intervals between the laser pumps are ~ 30 km which is a small fraction of 
the soliton period zo several hundred kilometres). In this way, the gain per interval is 
kept low enough to avoid excessive amplification of noise. 

A second method. Figure 15.14 uses lumped amplifiers in the form of short lengths 
~3m of optically pumped fibres doped with a rare earth such as Erbium. Again, the 
interval between these lumped amplifiers is <C Zo the soliton period to keep the noise 
amplification low. The lumped amplifiers are energized by laser diode chips and for an 
input of ~ 10 mW a gain of 30-40 dB is obtained at the useful wavelengths. The power of 
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A — Erbium-doped amplifying fiber 
OP — optical pump (A= 1480 nm) 

C — coils of transmission line 



3 _ 1 Rio nm 




25 km 



25 km 



c c 



OP 25 km 

c 



Figure 15.14 Solitons are now maintained by Lumped amplifiers in the form of « 3 m lengths of 
optically pumped fibres doped with the rare earth Erbium separating 25 km lengths of transmission 
line. The interval between the Low gain amplifiers <C z 0 (the soliton period) to avoid noise 
amplification 



these amplifiers is useful in multiplexing, the subject of the next section (see Desurvire, 
1992). 

Multiplexing This refers to the possibility of sending more than one channel of 
information down a single fibre. In current transmission systems, non-linear interaction 
causes severe interchannel interference but solitons are transparent to each other. They are 
unaffected by collisions and do not interfere with each other. 

In multiplexing, two channels along a single fibre are provided by solitons which are 
polarized in planes perpendicular to each other. 

Even more channels are possible with wavelength division multiplexing. Solitons of 
different wavelengths have different velocities and analysis shows that in a system using a 
chain of lumped amplifiers, adjacent WDM (wavelength division multiplexed) solitons 
interact just as in a lossless fibre so long as the collision length (twice the length of a 
soliton) is two or three times the amplifier spacing (Figure 15.15). 

This implies that several multigigabit per second WDM channels spanning a wavelength 
separation of 1 or 2 nm may be used in a single fibre. 




t t t t 



A A A A 

< Collision length *■ 

Figure 15.15 Wavelength division multiplexing is possible with solitons of different wavelengths 
and velocities. These solitons do not interfere with each other so long as the collision length (twice a 
soliton length) is two or three times the lumped amplifier spacing (see Mollenauer et at., 1990) 
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In a conventional transmission line each channel must be isolated at the regenerative 
amplifiers and separately processed but one amplifier can handle all soliton channels and 
Erbium-doped amplifiers are powerful enough to do this. 

Random Noise Effects and the Frequency Sliding Guiding Filter There are two main 
sources of error which affect an optical soliton transmission system: fluctuations of pulse 
energy and arrival time at the receiver. Spontaneous emission (noise) always accompanies 
coherent Raman gain and at each amplifier, amplified spontaneous emission (ASE noise) is 
added to a soliton which can change its energy and its central frequency in a random way. 
The change of energy may affect the amplitude of a soliton and the accumulated effect may 
reduce a soliton to such an extent that its intended arrival as a ONE in the bit system is 
registered as a ZERO. Alternatively, amplified noise may register a ONE in a ZERO space. 
This contributes to the bit error rate (BER) which must be kept below the international 
standard of < 10 ~ 9 . 

The ASE change in the frequency of the soliton changes its velocity and therefore affects 
its arrival time, throwing the pulse out of its proper time slot. 

Amplitude and time jitter may be reduced by narrowing the bandwidth of the 
transmission line (Mollenauer, 1994), using a narrow band filter at each amplifier. Each 
filter is a low-finesse Fabry-Perot etalon (p. 343), centred on the true frequency peak of 
the soliton (Figure 15.16). A soliton whose frequency has been shifted from the filter peak 
suffers a loss across the spectrum provided by the filter. This, together with the non-linear 
effect which generates new frequencies, pushes the soliton back towards the filter peak. In 
this way, the noise-induced frequency shift is returned to zero rather than being maintained 
as it would in a broad-band transmission line. 

Amplitude jitter is damped because a pulse with excess energy will narrow in time and 
broaden in spectrum more than the average and will suffer a greater loss at each filter. 
However, the soliton loss at each filter must be replaced at each amplifier by an excess gain 
with a resulting growth in noise. 

Mollenauer et al. (1994), found that even when the soliton source laser was not tuned 
exactly to the filter peak frequency, the soliton was still guided rapidly on to the filter peak. 
The filter peak frequencies were therefore gradually slid with distance so that the soliton 
frequency followed the filters while the noise remained in its original frequency band and 



Etalon filter 




Figure 15.16 Noise effects in an optical transmission line are reduced using a narrow band Fabry- 
Perot etalon filter at each amplifier. The low finesse, R ~ 9%, of fixed frequency filters can be 
increased, R > 9%, if the frequency of the filters is gradually shifted with distance along the line. 
The soliton frequency has no difficulty in adjusting itself to this shift and noise is progressively 
reduced (see Mollenauer et at., 1994) 
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its growth was inhibited. This noise reduction allowed the etalon filters to be strengthened 
to a higher finesse. Experiments with a soliton pulse width of r ~ 16 ps, D ~ 
0.5 ps nm “ 1 km “ 1 , amplifier spacing = 26 km with one filter per amplifier, and a frequency 
sliding rate of 7 GHz 1 0 3 km gave a net frequency shift over 9000 km (trans-Pacific 
distance) of a few soliton bandwidths, i.e. 0.5 nm at A = 1557 nm. Such a series of sliding 
frequency etalon filters can operate over a range of wavelengths wide enough to allow 
several channels of wavelength division multiplexing. 



Problem 15.1 

The properties of a stationary gas at temperature To in a large reservoir are defined by c o, the 
velocity of sound, h o = c p Tq, the enthalpy per unit mass, and 7, the constant value of the specific 
heat ratio. If a ruptured diaphragm allows the gas to flow along a tube with velocity u, use the 
equation of conservation of energy to prove that 

cl _ 7+1 *2 

7-1 2(7-1) 

where c* is the velocity at which the flow velocity equals the local sound velocity. 

Hence show that if u\/c* = M * and u\/c\ = M s , then 



M* z 



(7 + \ )M Z 
(7- l)M s 2 + 2 



Problem 15.2 

Using a coordinate system which moves with a shock front of velocity u 1 , show from the 
conservation equations that c* in Problem 15.1 is given by 

C*^ = U i«2 

where u 2 is the relative flow velocity behind the shock front. 

Problem 15.3 

Use the conservation equations to prove that the pressure ratio across a shock front in a gas of 
constant 7 is given by 

P2 _ P - a 
P 1 l- Pa 

where P = pij p\, the density ratio, and a = (7 — 1)/ (7 + 1). 



Problem 15.4 

Use the results of Problems 15.1 and 15.2 with the equation of momentum conservation to prove that 
the shock front Mach number is given by 



u I 

M s = — = 

Cl 



v+ a 
1 + a 



where y = pi/pi, the pressure ratio across the shock and a = (7— 1)/ (7 + 1) - Hence show that the 
flow velocity behind the shock is given by 

ci(l - a)(y - 1) 

\J 1 ( 1 + a) (y + a) 



ll = 
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Problem 15.5 

The diagrams show (a) a shock wave of pressure p 2 and flow velocity u propagating into a stationary 
gas, pressure p\, and (b) after reflexion at a rigid wall the reflected wave of pressure p 3 moving back 
into the gas behind the incident shock still at pressure p 2 . Use the result at the end of Problem 15.4 to 
show that the flow velocity u r behind the reflected wave is given by 

Mr = (1 ~ a)(p3,/p 2 - 1) 

Cl sj ( 1 : J a){p3/pi + a) 

and since u + u r = 0 at the rigid wall, use this result together with the ratio for C 2 /C 1 = ( T 2 /T 1 ) 1 ^ 
to prove that 



P3 _ ( 2 a + l)y - a 
p 2 ay + 1 



where y = P2/P1 and a = (7 — l )/(7 + 1 ). 
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> 
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Problem 15.6 





Use Problem 15.5 to prove that the ratio 



P3 ~ Pi 



(b) 
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wall 



P2-P1 a 

in the limit of very strong shocks. (Note that this value is 8 for 7 = 1.4 and 6 for 7 = 5/3, compared 
with the normal acoustic pressure jump of 2 upon reflexion.) 



Problem 15.7 

Equation (15.9) evaluates u x for u =f(x — ut ). Obtain u, in a similar way and use this with equation 
(15.9) to prove equation (15.8). 



Problem 15.8 

Burger’s equation u, + uu x — vu xx = 0 where v > 0 is a special case. It has a second-order 
dispersion term and is directly integrable. Show that u = — 2 vip x /il> transforms Burger’s equation 
into the diffusion equation 

d z ip d 2 4 > 

dt 2 dx 2 

For fluids, v is a measure of viscosity which dissipates excess momentum in non-linear waves. 
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Problem 15.9 

Show that u(x, t) = 2a 2 sech 2 a(x — ct) is a soliton solution of the KdV equation 
u i + 6uu x + u xxx = 0 after calculating u t , u x and u xxx as shown in the text. 

Problem 15.10 

For small values of q, log (1 + <7) « q. Show that values of u(x, t) near the base of Figure 15.5(a) 
where uu x ~ 0 may be written 



u(x,t) ~ 2 ^2 e_2 “ (;c_c ' ) 

and that this satisfies the dispersion equation u r + u xxx = 0 if c = 4a 2 . 

Problem 15.11 

Use the method of Problem 15.9 to show that u(x) = —2a 2 sech 2 a(x — xo) is a solution of the KdV 
equation u , — 6 uu x + u xxx = 0. 

Problem 15.12 

Prove equation (15.13) if u = v 2 + v x . 

Problem 15.13 

Verify equation (15.14) for u(x) = v x + v 2 and v = 1 p x /ip. 



Problem 15.14 

Show that the wave function ?/> = A secha(x — xo) where A is a constant satisfies Schrodinger’s 
equation (15.15) when A = —a 2 . 

Problem 15.15 

KdV equations are invariant to a Galilean transformation. Show that the transformations u — > u — A 
where A is constant together with x — > x + 6A t returns u, + 6 uu x + u xxx = 0 to its original form. 



Problem 15.16 

At time 1 = 0 a high amplitude signal has a profile y = flsin7rx with dy/dt = 0. Thereafter, it 
propagates according to the non-linear wave equation 



d 2 y 

dt 2 




d 2 y 

dx 2 



where e is a small positive constant. 

Show that the time required for the leading edge of a positive signal to become infinitely steep is 
given by 

t = A/coean 2 

Hint : Rayleigh’s method (Rayleigh, Theory of Sound, Vol. 2, Dover Press p. 35), shows the required 
time to be the reciprocal of the maximum value of |dw/djc| where d u is the relative phase velocity 
between two points on the leading edge of a positive signal separated by a horizontal distance d.v. 
Note that waves propagate in the positive and negative x-directions. 
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Appendix 1: Normal Modes, Phase 
Space and Statistical Physics 



The last line of the introduction to the first edition states that ‘it is the wide validity of 
relatively few principles which this book seeks to demonstrate’. Here we apply that concept 
to the relationship between normal modes which feature in most of the book, phase space 
of the final chapter, and statistical physics. 

Firstly, we wish to show that the expression for the number of normal modes per unit 
volume in the frequency range u to v + du given on p. 253 as 

4ttu 2 Au 

An = r — 

c i 

is nothing more than the number of ‘cells’ of phase space per unit volume in the same range 
v to v + Au available to particles in a statistical distribution. 

Moreover, we can easily convert this expression in the frequency u to one in the velocity 
v, the momentum p = mv or the energy E. 

The particle may be a molecule in the classical Maxwell-Boltzmann distribution 
(M-B ), a fermion of half integral spin in the quantum Fermi-Dirac distribution (F-D) or 
a boson or any other particle of integral spin in the quantum Bose-Einstein distribution 
(B-E). Bosons are the messengers of the force fields in physics, e.g. the photon in the 
electromagnetic field. 

We shall see that each of these distributions is nothing more than the statement that 

7i, — gi x probable occupation of the phase space cell. 

Here ??, is a number of particles in the distribution and ", is our expression 4irv 2 Av/c 2 (or 
its equivalent). 

The expression for ", is common to all three types of distribution but the occupation 
factor or relative probability of occupation depends on the way in which the particles are 
allowed to distribute themselves. 

Firstly, let us examine the various equivalent forms of g,-. We write 

gi — gi{v) Au = 4nu 2 Au/c 3 
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Appendix 1: Normal Modes, Phase Space and Statistical Physics 



as the number of phase space cells per unit volume in the frequency range v to v + die For 
a quantum particle (p. 415) the momentum p = Tik = hv/c where h is Planck’s constant, k 
is the particle wave number = 2 tt/A and c is the velocity of light, so 

gi = gi(p) d p = 4np 2 dp/h 3 

is the number of phase space cells per unit volume in the momentum range p to p + dp. 
Note that 4np 2 dp is the volume of the shell in momentum space between spheres of radius 
p and p + dp. 

All particles in statistical distributions are required to be free particles, that is having 
only kinetic energy with no potential energy interaction terms. 

Thus, the energy of a particle E — \ mv 2 = p 2 /2m where p = mv, m is the particle mass 
and v is its velocity. Now 

p 2 dp = (2m 3 ) 1 / 2 £' 1 / 2 d£ = m 2 v 2 mdv = m 3 v 2 du 



so 



gi = gi (E) d E = 4tt( 2 m 3 ) 1/2 £ ! / 2 d E/h 3 

is the number of phase space cells per unit volume in the energy range E to E + d E and 

gi = gj(v) du = 47rm 3 u 2 dv/h 3 

is the number of phase space cells per unit volume in the velocity range v to v + di>. 

Although we used the phase space of x or v with x in our discussion of chaos, the phase 
space of mv or p with x is much more commonly used in physics. The phase space of ( p. x) 
reveals the significance of h 3 in the denominators of g , . Consider the expression 

4irp 2 dpV/h 3 

where V is the total volume (not the unit volume) so that the numerator expresses the phase 
space over the momentum range p to p + dp and the volume V — xyz of the system. 

Heisenberg’s Uncertainty Principle, p. 416, tells us that AxAp ~ li, so we may write 
(AxAp x )(AyAp y )(AzAp z ) as h 3 \ that is, the ‘volume’ of a cell in (p, V) phase space. 
This volume is the smallest acceptable volume which a particle may occupy for it defines 
the volume associated with a particle as 

AV~ 

A,,) ~ 

where Adb is the de Broglie wavelength of the particle (p. 412). 

So gi measures the number of phase space cells each of ‘volume’ h 3 per unit volume in 
the range p to p + dp. Each of these cells may or may not be occupied by a particle. 

We now examine what we mean by a statistical distribution in order to find the probable 
occupation of a cell. This occupation factor is different for each of the three distributions 
M-B, F-D and B-E. 
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We consider a system, say a gas, of IV particles occupying a volume V and having a total 
internal energy E. The macroscopic parameters E, V, N define a macrostate. The energy E 
may be partitioned in many different ways among the N particles subject only to the 
restrictions that E=Y2 n i £ i and iV = Y2 n ‘ rema in constant where £, represents the 
energy levels available to the particles. The probability of a system being found in a 
particular partition is proportional to W the number of ways of distributing the energy 
among the particles to achieve that partition. 

Each different way is called a microstate and each microstate has a priori the same 
probability. Each microstate contributes to the statistical weight of a partition so that the 
particular partition reached by the greatest number of ways has the greatest statistical 
weight and is therefore the most probable. The most probable partition with W (maximum) 
defines the equilibrium of the macrostate and is written 0 ( EVN ). 

It is here that we relate (EVN) to the concept of entropy S. Entropy is a measure of the 
disorder of a system which increases as the system tends to equilibrium. At constant 
temperature and volume the internal energy E of the system may be written 

E = F + TS 

where T is the temperature, S is the entropy and the product TS is a measure of the energy 
of the system locked in the disorder amongst the particles and not available for work. F is 
defined as the Helmholtz free energy and measures the work which can be done by the 
system at constant temperature. At best, in an ideal reversible thermodynamic process 
the disorder energy TS remains constant, but in a natural or thermodynamically irreversible 
process TS increases at the expense of F as E remains constant. 

An isolated system in equilibrium with the most probable partition of its energy among 
its particles represents a maximum of its entropy S and Boltzmann related S and Q through 
his expression S = k log Q where k is Boltzmann’s constant. Fluctuations from the 
equilibrium position are very small indeed and log 0 is a very sharply defined function. 

Calculating the value of W the statistical weight of a partition in order to find W 
(maximum) = 0 (EVN) for each of the three distributions is a mathematical exercise which 
is straightforward and a little tedious but which fails to reveal the underlying physics. 

We shall make these calculations at the end of this appendix but we adopt the procedure 
of quoting the results below together with the forms in which we usually meet them. This 
will raise questions the answers to which are not evident in the mathematical derivation 
(Table Al.l). 

For all three distributions the particles are identical and indistinguishable, the total 
energy E and number of particles N are constant. There are no restrictions on the number of 
particles having a particular energy in the M-B and B-E distributions but in the F-D 
distribution, Pauli’s exclusion principle allows only one fermion per energy level (or two if 
we include spin). 

Note firstly that the occupation factor or relative probability of occupation for each 
distribution includes the term e“ +/3e; , where a and ,6 arise as multipliers in the 
mathematical derivation. The index of the exponential requires (3 to be the inverse of an 
energy and the relevant term in the normal form of the Fermi-Dirac distribution suggests 
that a is the ratio of two energies. 
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Table Al.l The mathematical derivation for each statistical distribution in the left hand column is 
compared with its more familiar form on the right 



/?,- = gi x occupation 
factor 



M-B 



F-D 



1 



= gi 



= g i e- a - (k 



e a+fie , 

fie, 

1 



M/ = gi 



%+fie , -(- 1 



B-E 



»i = gi 



1 

£ a+fie i _ ! 



Normal form 



n_ = 4irp 2 dp ^ - p i/ 2m kT 

N (2mnkT) 3 ^ 2 



. lAnVllm 3 ) 1/2 £: 1 / 2 

n(E)dE = ^ 

n{v) d vhv = E( v) dv 
IAtw 2 dvhv 1 

^3 ^ f±hv/kT | 



(p = mv) 

1 

e (e,-e F )/kT _|_ l 



Planck’s radiation law 



In comparing the two columns of the table several questions arise: 

1. Is p= 1 /kTl 

2. What has happened to the a term in the normal form of M-B? 

3. What is the physical significance of the a term? 

4. What has happened to the a term in Planck’s radiation law? 



In question 1 let us integrate by parts the expression 






:•** i* dp 

op 



where 



e = p 2 /2m 



For s — » oo as p — > ±oo the first term on the right hand side equals zero, leaving 



1 

P 



de _ 0£ 

p— e dp 
op 

p 



de 





the average value of 
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From the equipartition of energy 



&E P 2 rr, 1 

P-S~ = — = kT = -3 
op m p 



where kT is the average energy per particle. 

In question 2 we note that the term e in M-B has been replaced by N / (iTunkT) 2 ' 2 
and that h 3 has been lost from the denominator of g,(p)dp. To explain this and its 
consequences let us write not n per unit volume but n p in the range p to p + dp over all 
V = xyz as 



V4np 2 dpe- p ^ 2mkT 



Then 

00 4irp 2 dpe- p2 / lmkr 



where the standard definite integral is well known to have a value of (liunkT ) 3/2 . 

Thus 

N = V(2irmkT) 3/2 /h 3 

Now the average particle momentum p = mv where \ nw 2 — kT (v is the most probable 
velocity). 

Hence 

(27 vmkT) 2 ^ 2 w p 3 



N = Y j n P =V 



Thus, ( V/N)p 3 replaces e“/z 3 and 



Vp^ = VJ_ 

Nh 3 NX 2 db 

Volume available to each particle 

Volume associated with the thermal de Broglie wavelength of the particle 

The value of e a = 0.026 m 3 / 2 T 5 / 2 at a pressure of one atmosphere, where m is measured in 
a.m.u. (O 16 = 16). 

For air at STP e“ ~ 10 6 so for the Maxwell-Boltzmann distribution 

Si_ = e a+0e, ^ 1q 6 £ Si/kT > | 
n i 

This states that there are many more states or cells available for occupation than there are 
particles to fill them, so the probable occupation of each cell is very small. This defines a 
classical distribution. 
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For the Bose-Einstein gas He 4 at 4 K and one atmosphere pressure e a ~ 7.5 so the gas 
is not safely classical. 

Although it is not strictly applicable, for electrons in a metal at 300 K, e“ ~ 10 4 so the 
classical description for the Fermi-Dirac case is totally invalid. 

A distribution which is not classical is said to be degenerate. Note that for high enough 
energies (temperatures) all three distributions become classical. 

Before we examine the origin of a and its physical meaning let us note that a factor 2 
appears in both the F-D and B-E distributions where each particle has two spin states for 
each energy level which must be accounted for. In Planck’s radiation law these spin states 
are equivalent to the polarization states of electromagnetic waves. Note also in Planck’s law 
that E(y) dv, the energy per unit volume in the frequency range v to v + dn, is n(v) didw 
where hu is the photon energy. 

Turning to question (iii) on the significance of a we again use the expression S = k log 0 
or 0 = e s ^ k . Consider a system in contact with a large reservoir at constant temperature. 
Figure Al.l, able to exchange both energy and particles with the reservoir. The 
combination of reservoir and system is isolated and its energy E, volume V and total 
number of particles N are all fixed and constant. 

We ask 'What is the probability of finding the system in a particular microstate with n 7 
particles having total energy ep.' This will be proportional to the number of microstates in 
the reservoir after n , and s ; are supplied to the system. 

The entropy equation with subscript R for reservoir becomes 

S„(E - t „H S,(£. *)- 



where we neglect higher terms in the expansion. 
Elementary thermodynamics shows that 




and 



/dS\ = —p 
\9NJev T 



T 






system 




reservoir 



Figure Al.l When a system, surrounded by a Large reservoir with constant N, 1/ and E receives nj 
particles and total energy sj from the reservoir, the entropy change of the reservoir is AS = 
( rijp — £j)/T where p is the chemical potential 
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where /i is called the chemical potential. The chemical potential // is the free energy per 
particle lost when the entropy S is increased in the relation E = F + TS where E is 
constant. Thus, the entropy change may be written 

AS = Sr(E — £j,N — nj) - S r (E, N) = - 1 ^ 

Because the statistical weight 0 ( EVN ) represents the probability of a partition, the 
probability of the combination of two partitions may be written as the product of their 
statistical weights so we have 



<>(/•; - ej.N - nj ) = q(e,n) n(£j. nj ) 

= 0 {E,N)e AS/k 

= 0 (E,N)e {n ^~ E JV kT 



In order to show the relation between a and —p/kT, we take as an example a system of 
four fermions available to occupy any of four single particle energy states e\, ei, £ 3 , £4 
(Table A1.2). The particles and energies are supplied by the reservoir and each energy level 
may be filled or empty. The numbers of possible microstates of the system using 0, 1,2,3 
or 4 particles are shown below together with their relative probabilities. 

For any microstate in which a particular energy level is filled we can find another which 
differs only in having that energy level empty. 



Table A1.2 Distribution of four fermions among four single particle energy states with numbers of 
possibLe microstates and their relative probabilities 





No 


One 


Two 


Three 


Four 




particles 


particle 


particles 


particles 


particles 


Number of 
microstates 


1 


4 


6 


4 


1 


Energy 
level £4 


0 


0 


0 


0 


1 


Energy 
level £3 


0 


0 


0 


1 


1 


Energy 
level £2 


0 


0 


1 


1 


1 


Energy 
level £ 1 


0 


1 


1 


1 


1 




nj = 0 


n j = 1 


n i = 2 


tij = 3 


n i = 4 




£j = 0 


(JO 

II 

U) 


£j = £\ +£2 


£j = £ 1 +£2 + £3 


Ej = £\-\-E 2 J r£?>-\-£A 


Relative 
probability 
of micro- 
state 


e ( 0-0 )/kT 


e (r<-ei )/kT 


gP/i-tfU+E l)]/kT 


e [ 3 ix-(ei+e 2 +ei)\/kT 


0 [ 4 /i— (e i+£' 2 +£ 3+£'4 )\/k-T 
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Thus, for example 

Relative probability of finding £3 filled p 
Relative probability of finding £3 empty 1 — p 

e [3/*- (e 1 +e 2+e 3 )] /kT 
e [2/i-(si +£ 2 )\/kT 
— e (M-£3 )/kT 



More generally 



P — e (p~ei)/kT 

1 ~P 



SO 



1 _ ^ 

P ~ e ( £ i ~d)l kT + 1 — n ‘ 

where 72, = gjhj and h , or the relative probability is the average occupation of a cell. 

This is the Fermi-Dirac occupation factor and we can identify a = —p/kT (the ratio of 
two energies) where p is the chemical potential. For the Fermi-Dirac distribution 77 , < 1 
and Figure At. 2 shows 72 , versus e for electrons in a metal at T = OK. 

Each energy level is occupied by one electron until the top energy level £ F the Fermi 
energy level is reached. At T = 0 K the electron with £ F is the only one capable of moving 
to change the entropy of the system and we identify its free energy with that of the 
chemical potential p. Note that, at £ F for T> 0 , n i = 2 and this is indicated by the dotted 
curve at £ F in the tj, versus £ graph. 

We may apply a similar procedure to particles obeying Bose-Einstein statistics where 
there is no restriction on the number of particles ;i, in the energy level £,. If 77 , can take any 
value, three identical bosons available to three energy levels (£ 1 , £2> ^3) can form the 



1 



Vz 



\ 

\ 

\ 



\ 




€ 



Figure A1.2 Occupation number rij versus energy e for electrons in a metal at T = 0 K (solid line). 
A slight increase in T permits the electrons near e f to move to higher energy levels (dotted curve) 
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microstates (3, 0, 0) (0, 3, 0) (0, 0, 3) (2, 1, 0) (0, 2, 1) (1, 0, 2) (0, 1, 2) (2, 0, 1) (1, 2, 0) 

(1, 1, 1). The energy of each microstate is given by Ej = ^ n,e, with nj — ^ n t . Suppose, 
as before, a large reservoir at temperature T surrounds a system to which it can supply 
particles and energy. 

We consider a particular microstate of the system with ni,n 2 ,n^ . . .«,■ particles in the 
various energy levels to have a probability p when n , = 0. 

If the system now takes n, particles each of energy e, from the reservoir the probability 
of the microstate (now with n t =/=■ 0) is given by 

p e m{ii-£i)/kT __ p e u jX 



where x = (p — £i) jkT . 

The total probability for the microstate with n, = 0, 1, 2, 3, . . . is 



1 = 




P 

(1 — e ' T ) 



because e"'* is a geometric progression. 
Hence 



P={ 1-e*) 



The average value 



Hi = 



i i — 

E 



n j = 0 



riipe ‘ 



But 



Therefore 



E 



n ,-e 



rijX 



d y c »,.v d 1 
dx^—^ dv(l — e x ) (l — e- v ) 2 



pe x (1 — e x )e x e x 
U ' ~ (l-e x ) 2 ~ (l~e x ) 2 ~ (1 -e r ) 

1 _ 1 

~ e~ x - 1 ~ Q{£i-v)l kT — 1 

The general expression for the Bose-Einstein distribution is therefore 

1 

n i — Si n i — gi x | 

Finally we discuss the absence of a or —p/kT in Planck’s radiation law, noting that this is a 
special case and that —p/kT is retained in other applications of Bose-Einstein statistics. 
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Black body radiation is an equilibrium process, so that the system or cavity of a box of 
photons is in equilibrium with the reservoir at temperature T, the entropy S is a maximum 
and this process results from the continual emission and absorption of photons by the walls 
of the cavity. The number of photons in the cavity is not conserved, the energy requirement 
could be satisfied by a few high energy photons in the 7-ray region or by many photons in 
the low energy infrared frequencies. This means that the occupation numbers are not 
subject to the constraint which specifies the total number of particles in the gas. 

Since N is not fixed, the entropy S of the reservoir is not affected by the nj photons in the 
exponent n/p of the occupation factor for a given microstate; n, has no role and /;,// = 0 
giving p = 0. 

The graph of the entropy S versus N, the total number of particles, gives low S values, 
that is few microstates or particle arrangements at low N (7-rays) and also at high N 
(infrared) photons. 

A typical microstate for 7-rays occupying the energy levels e, would read 
n 1 = 0 , H2 — 0 , n 3 = 0 with oo 7^ 0 
and for infrared photons a typical microstate would read 

n 17^0 n 2 — O «3 = 0 

Both of these are extremely unlikely and would contribute to partitions of low statistical 
weight. 

At equilibrium the maximum of the S versus N curve occurs at that value of N providing 
the greatest number of microstates and here 




again giving p = 0. 



Mathematical Derivation of the Statistical Distributions 

The particles are identical but distinguishable by labels. All energy e states are equally 
accessible and have the same a priori probability of being occupied. The statistical weight 
or probability of a particular partition is proportional to the number of different ways of 
distributing particles to obtain that partition. 

Maxwell-Boltzmann Statistics 

We start by filling the e 1 states with n 1 particles from the constant total of N particles. We 
can do this in 

N\ 

n 1 ! (N — n 1 ) ! 



different and distinguishable ways. 
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We now fill the £2 state with 112 particles from the N — n\ remaining particles. This 
gives 



( N — n\)\/n 2 \(N — n\ — ri 2 )\ 



different and distinguishable ways. 

Proceeding in this way for all remaining energy states we have 

N\ 

W = 

u 1 ! n 2 ! n 3 ! . . . 

as the number of different and distinguishable ways of choosing n\,n 2 ,n^, . . . from the N 
particles. Particles with the same e, may have g, differing amounts of angular momentum, 
etc. This will give g, cells associated with e, in each of which a particle with e, may be 
located. If «, is the probability of having one particle in the e, range of cells then 
gi x gj = gj is the probability of two particles in that range and g"' is the probability of n, 
particles with e, being in that range. 

Hence the total number of different distinguishable ways is 

AT\ Wl ft 2 ft3 

w _ Nl Si S2-83 3 ■■■ 

1 ! ^ 2 ! ^ 3 ! • • • 



The particles are distinguished by labels and if we now remove the labels and the condition 
of distinguishable particles, we cannot recognize the difference in the partition when 
particles are exchanged. Therefore all N\ permutations among the particles occupying the 
different states give the same partition with the total number of ways 



W = 



n 1 ft 2 w 3 

gj g 2 ~£3 
n 1 !«2^3! • 



We now maximize log W with the constraints that 



1. The number of particles N — n t = constant so that d N — d n , = 0. 

2. The energy E = YL n < £ < = constant so that d E = ^ e: , d /r , = 0. 



log W = i log gi - log «,■!) 



where for large n, Stirling’s formula gives 

log n,! = n j log n ,■ - n, 



Hence 



logw^y n, iog^+y m 

L — / n, A ^ 




544 



Appendix 1: Normal Modes, Phase Space and Statistical Physics 



and 



d (log W) = ^ d/i, log 'j + ^2 zi/dlog ^ + ^2 d 



\n t 



= ^ dn, -log 
(because g,- 
= J2 d/i,- log 



Si \ d/Ij 

— - >. «< — 
„ / z — / n 



is constant and d/i, = 0) 




If ^ dn i = 0 then —a Y dn , = 0 and 
if X] e, d/i,- — 0 then — /? Y £ i d /? / = 0 
where a and f3 are called Lagrange multipliers. 

Adding these constraint conditions to d(log W ) gives 



d (log W) = ^2 d/i/ 




a — fai 



Maximizing IV gives d(log W0 = 0 which, since all the coefficients d//, are arbitrary and 
independent, leaves 



log 




— a — /?£,- = 0 



for each /i, . 

At lV m ax we have therefore 



n, = gi x 



1 

0 CX-\-/3£ i 



Fermi-Dirac Statistics 

We begin again with labelled identical particles. Here the Pauli exclusion principle 
operates and no two particles may occupy the same state. The «, are quantum states, e.g. 
spin gives a factor 2 to each g- t . Also «, gives the maximum number of particles with e, so 
«/<£;• 

To fill the 8 j states with /i, particles we put one particle in a g,- cell and the next particle 
in any of the ( g f — 1) remaining cells. We can do this in g,-(gj — 1) ways so the total 
number of ways of filling the states of energy e, with /i, particles is 

gi(gi - l) . . . (gi - m + l) 

g,! 

(gi ~ nt)\ 

If now the labels are removed and the particles become indistinguishable we reduce the 
total of different distinguishable arrangements to g,!//ii!(g/ — «/)!. 
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Applying this to all g,- gives the total number of different distinguishable ways as 

w = §i- gfl # 3 ! 

nf.(gi — n\)\ n 2 \(g 2 -n 2 )\ nf.(g 3 - n 3 )! 

Maximizing log W with ^ d/t,- = ^ e, dn, = 0 we proceed as with the Maxwell - 
Boltzmann example to obtain for W (max) the condition that 



log 




— a — 0£j — 0 



to give 



n, — gi x 



1 

e a+0£i + 1 



Bose-Einstein Statistics 

Here there is no exclusion principle and we begin again with labelled identical particles. 

The number of distinguishable arrangements of n, particles in the g,- cells of energy s, 
equals the number of ways of putting «,■ objects in g , boxes with any number allowed in a 
box. This means putting /r, particles in a row separated by gi — 1 walls so that the number 
of ways is the number of permutations of («, + g , — 1) objects, i.e. particles and walls. 
This gives ( 72 , +g, — 1)! ways. If we now remove the particle labels to make them 
indistinguishable we reduce the number of ways by a factor of n\ to give ( 77 , + g, — 1 ) ! / 77 ,- ! 
ways. 

However, all permutations of the g, — 1 dividing walls among the 72, particles give the 
same physical state, so the number of different distinguishable ways is given by 
( 7/ , -T g i — 1 ) ! / n i ! (g i — 1)! and for all particles we have the number of ways 

= (m - gi - 1) {n 2 + g 2 - 1) 
mKgt-1) n 2 \(g 2 -l) 

Maximizing log W as for the other two distributions gives d(log W) = 0 when 



log 




— a — (3ej = 0 



that is, when 



1 



= gi 



e a+/ 3 e f _ J 




Table A1.3 Waves incident normally on a plane boundary between media of characteristic impedances Z\ and Z 2 

Amplitude Coefficients 

j d Z Reflected r Transmitted, Reflected r Transmitted , 

+ ve for wave in Incident, Incident, Incident, Incident, 
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Appendix 2: Kirchhoff's Integral 
Theorem 



Kirchhoff’s Integral Theorem is valid for any solution E of the scalar time independent 
Helmholtz equation (3), p. 187, that is 



<9 2 E 

Ih 2 



+ k 2 E = 0 



For the radial direction r in a spherical coordinate system this becomes 

d 2 E 2<9E_ 

~dr I + ^lk~ 



which is satisfied by 

E = — e' kr 
r 

where Eo/r is the amplitude at a distance r from the origin O of a spherical 
electromagnetic wave. We note that the amplitude of such a wave decays as 1/r where 
r is the distance from O. 

Kirchhoff’s Theorem states that the complex amplitude E/> at a point P is related to the 
complex amplitude E on a surface S enclosing P by 



E P = 



1 

4-7T 



' / d e' kR 

A dn R 



e ikR 5E\ 
-Rlki) 



d5 



where R is the distance from P to the surface element dS and n is the direction normal to dS 
(Figure A2.1). 

If r is the distance from O to d S, then 



E = 



E o 



e 



i kr 
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Figure A2.1 0 is the origin of an electromagnetic wave. Kirchoffs Theorem relates its complex 
amplitude E p at a point P to the complex amplitude E on a surface 5 enclosing P 



and 



9E 

dn 



y e^i cos (n, r) 



The term (ik — 1/r) shows that inside S there is a phase shift of 7r/2 rad and an amplitude 
factor 1/r. However, for r = mX, where m is large, then 

2tt 1 1 

k =x >> - r =^\ 

so that 1/r may be neglected for distances much greater than A. 

Similar arguments hold for 



d t lkR 
dn~R~ 

Thus, if P and O are many wavelengths from S , Kirchhoff’s integral becomes 



e ik(r+R) ( cosni R _ CO sn, I - ) 

^ Cl»3 

rR 2 

where the cosine terms generate an inclination factor K(x) a nd cos(n.R) = cos ;y. 

The problem of showing that Huygens wavelets on an unobstructed wavefront do not 
propagate backwards reduces to that of demonstrating that K(x) can be zero. This occurs 
where 




cos (n, R ) = cos 7T = — 1 




Appendix 2: Kirchhojf’s Integral Theorem 



549 




Figure A2.2 When P' is Located on r within the surface of the spherical wavefront Si, situated 
within S, E pi is reduced to zero proving that Huygens wavelets do not propagate backwards 



and 

cos (n, r) = cos 7r = — 1 
This is achieved in the following way. 

The surface S designated So now encloses a spherical wavefront surface Si centre O. Si 
and S 2 are said to be doubly connected and the surface integral now includes Si and So 
(Figure A2.2). At Si the normal n to dS on So now points towards O and if the outer 
surface of S 2 is allowed to expand to infinity its contribution to the integral becomes zero. 
This leaves only the integral over the surface where Si and S 2 coincide. The singularity 
Eo/r at O is excluded from the integral. 

If P is now located on r, at P', that is in the direction of backward propagation of 
Huygens wavelets, then 



cos (n, R) = cos 7T = — 1 



and 



cos (n, r) = cos zr = — 1 

K(x) is th en equal to zero. Any other position for P gives 

x cos x - cos (n, r) 1+cos* 

K KX) = = 



2 



2 




Appendix 3: 

Non-Linear Schrddinger Equation 



This equation describes phenomena in non-linear media with strong dispersion. It appears 
in several forms. For optical soliton purposes, Mollenauer et al. (1982) derive it from the 
equation 



where 



( du 



+ k 




—k 2 d 2 u 
2 dt 2 



T 7|m| 2 u 



(A3.1) 



dk 
du> ’ 



ki 



d 2 k 

du 2 ' 



and 




«2 

no 



n 2 and n o appear in the Kerr Optical Equation n — no = nil. 

Equation (A3.1) is satisfied by a pulse of the form 

E(z, i) = u(z, t) e^-M 

Using the transformation of Mollenauer et al. (1980), (A3.1) assumes the dimensionless form 



. dv 









ld 2 v . l2 
2 d? + '"^ V 



(A3. 2) 



which has a soliton solution u(£,s) — sech (.v)e'^ /2 where 

s = T~\t-kiz) C = \k 2 \T~ 2 z 

and 




where T is a measure of the width of the input optical pulse. 

The first term on the right hand of equation (A3. 2) describes the effects of dispersion 
which may be seen as the kinetic energy term in the linear Schrodinger equation, while the 
second term corresponds to the energy of a self-trapping potential proportional to \u\ 2 
arising from the non-linear refractive index which may be interpreted in probability terms. 
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Absorption of wave energy, 185, 191, 209 
Acoustic waves, 151 
finite amplitude, 505 
Airy disc, 385 

Anharmonic oscillations, 459 
Attenuation coefficient, 185, 209 

Band Theory, 445, 448 

Bandwidth Theorem, 132, 285, 376 

Beats, 14, 83, 130 

Bessel’s functions, 381 

Bloch functions, 441 

Bragg reflection, 447 

Brewster angle, 220 

Brillouin zone, 137, 445 

Bulk modulus, 152 

Cerenkov radiation, 508 
Chaos, 

attractors (chaotic), 

Lorentz, 500 
Ueda, 487 

basin of attraction, 485 
Cantor set, 491 
Couette flow, 495 
electrical oscillator 
non-linear, 477 
relaxation, 467 
Feigenbaum limit, 474 
fractal, 490 
in fluids, 494 
Koch snowflake, 490 



laser-Doppler technique, 495 
limit cycle, 469, 485 
logistic equation, 469 
manifold, 487 
period doubling, 469 
pitchfork bifurcation, 47 1 
phase space, 481, 533 
Poincare section, 486 
population biology, 469 
Rayleigh-Benard convection, 497 
Rayleigh criterion (flow instability), 
497 

Rayleigh number, 499 
repellor, 469, 485 
Reynolds number, 496 
saddlepoint, 483 
separatrix, 483 
Smale horseshoe, 493 
stable point attractor, 469 
Taylor cell, 496 
Van der Pol equation, 467 
varactor, 477 

Complex number notation, 26, 53 
Convolution, 292 
Array Theorem, 388 
Theorem, 297 
Cornu spiral, 396 
Coupled oscillations, 79 
electrical, 87 
on a loaded string, 90 
spring-coupled pendulums, 79 
wave motion as the limit of, 95 
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Index 



Criterion for dielectric-conductor 
behaviour, 212 

Cut off frequency, 95, 244, 355 

Damped simple harmonic motion, 37, 41 
critical damping, 40 
dead beat damping, 39 
logarithmic decrement, 44 
oscillations, 41 
rate of energy dissipation, 47 
relaxation time, 45 

De Broglie wavelength, 412, 534, 537 
Debye theory of specific heats, 253 
Decibel, 158 
Degeneracy, 250, 425 
Deviation by a prism, 312 
by a lens system, 317, 322 
Diffraction, Fraunhofer, 367 
circular aperture, 379 
far field, 383 
rectangular aperture, 377 
single narrow slit, 367 
transmission grating, 373 
Diffraction, Fresnel, 395 
circular aperture, 401 
Cornu spiral, 396 
slit, 395 

straight edge, 395 
zone plate, 402 
Diffusion equation, 187 

added to wave equation, 190, 209 
Dipole radiation, 362 
Dirac 5 function, 292 
Fourier transform, 292 
sifting property, 292 
Dispersion, 

anomalous, 131, 522 
normal, 130, 515 
Displacement current, 201 
Doppler effect, 141 
shock waves, 506 

Earthquake, 161 

Eigenfrequencies, 86, 125, 245, 418 
Eigenfunctions, 418 
Electromagnetic waves, 199 
in a conductor, 208 
in a dielectric, 202 
in the ionosphere, 227 



in plasma, 223 

Electron waves in solids, 441 
Energy, 

density in an electromagnetic wave, 208 
distribution in a sound wave, 155 
distribution in a velocity pulse, 278 
in harmonic mode of a vibrating 
string, 126 

Evanescent wave, 256 
Exponential series, 25 

Fabry-Perot interferometer, 341 
central spot scanning, 346 
filter, 527 
finesse, 345 

free spectral range, 345 
resolving power, 343 
Fermat’s Principle, 307 
Fermi energy level, 426, 540 
Forced oscillator, 
electrical, 55 
mechanical, 57 
power supplied to, 68 
steady state behaviour, 58 
string as a forced oscillator, 115 
transient behaviour, 58, 74 
Fourier Integral, 283 
Fourier Series, 267 

application to plucked string, 275 
application to velocity pulse, 278 
frequency spectrum of, 281 
Fourier Transform, 285 

application to Gaussian function, 289 
application to optical diffraction (one 
dimension), 287 

application to optical diffraction (two 
dimensions), 378, 379 
application to slit function, 286 
Fraunhofer diffraction, 367 
Fraunhofer far field diffraction, 383 
Fresnel diffraction, 395 
Fresnel's equations, 220 

Group, 

many components, 132 
two components, 128 
velocity, 109, 130 
wave group, 128 
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Heisenberg's Uncertainty Principle, 135, 414 

Helmholtz equation, 187 

Helmholtz equation (optical), 313, 321 

Holography, 403 

Huygens wavelets, 305, 547 

Impedance 

characteristic of string, 117 
characteristic of transmission line 
(lossless), 175 

characteristic of transmission line 
(real), 186 
conductor, 215 

connection with refractive index, 220 
dielectric, 207 

forced oscillator (electrical), 55 
forced oscillator (mechanical), 57 
quarter wave matching, 124 
specific acoustic, 158 
Instantons, 521 

Intensity of sound waves, 157 
Interference, 

amplitude division, 333, 334 
dipole radiation, 362 
linear array of N sources, 363 
missing orders, 373 
spatial coherence criterion, 360 
two sources, 355, 357 
Young’s slit experiment, 357 
wavefront division, 333, 355 
Interference fringes, 358 
of constant inclination, 335 
of constant thickness, 336, 355 
Newton’s Rings, 337 
visibility, 360 
Interferometer 
Fabry-Perot, 341 
Michelson’s Spectral, 338 
Resolving power, 343 
structure of spectral lines, 340 
Ionic crystal 

infrared absorption in, 140 
thermal expansion in, 463 
wave propagation in, 138 
Ionosphere, 227 

Kerr optical effect, 522 
Kirchhoff Integral Theorem, 547 
Kronig - Penney model, 441 



Lame’s elastic constants, 159 
Laser cavity, 347 
Line spread function, 392 
Lissajous figures, 19 
Logarithmic decrement, 44 

Mach Cone, 507 

Mach number, 510 

Magnification by spherical surface, 

316 

Magnifying glass, problem 328 
Matrix applications 

coupled oscillations, 86 
lens systems, 325 
multilayer dielectric films, 350 
Maxwell’s equations, 202 
Michelson’s spectral interferometer, 338 
Michelson’s stellar interferometer, 386 
Microscope, problem, 330 
Modulation transfer function, 391 
Multiplexing, 526 

Newton's optical equation, 320 
Newton's Rings, 337 
Non-linear oscillations, 459 
restoring force, 460 
Non-linear waves, 505, 514, 515 
Normal coordinates, 81 
Normal frequencies, 81, 86 
Normal modes, 81 

one dimension, 81, 125 
three dimensions, 250 
two dimensions, 246 
Normalization of wave functions, 423 

Optical fibre wave guide, 353 
Optical Helmholtz equation, 313, 321 
Optical reflection and refraction, 

217, 254, 307 
Optical system, 313 
Optical transfer function, 391 
Oscillations 

anharmonic, 459 
coupled, 79 

damped simple harmonic, 41 
electrical relaxation, 465 
finite amplitude, 459, 505 
non-linear, 459 
simple harmonic, 1 
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Paraxial rays, 313 

Partial differentiation (notation), 96, 

107 

Particle velocity, 109 
Phase transfer function, 391 
Phonons, 450 
Pinch effect, 226 

Planck's Radiation Law, 251, 536, 541 
problem, 262 
Plasma, 223 

Point spread function, 391 
Poisson’s ratio, 159 
Polarization, 17 
Power (optical), 

of one spherical surface, 314 
of thin lens, 318 
of two spherical surfaces, 317 
Poynting’s vector, 206 
Propagation constant, 185 

Quality factor Q, 45 

connection with Resolving Power, 377 
of an oscillator, 70, 71 

Radio transmission and reception, 229, 362, 
366 

Raman effect, 524 
Reciprocal lattice, 452 
Reduced zone scheme, 445 
Reflection and transmission of waves at 
a boundary 
acoustic, 163 

electromagnetic by a conductor 
(normal incidence), 222 
electromagnetic by a dielectric (normal 
incidence), 217 

electromagnetic by a dielectric 
(oblique incidence), 218 
electromagnetic (optical laws), 

254, 307 

electromagnetic (total internal), 256 
on a string, 117 
on a transmission line, 177 
quantum particles at a potential 
barrier, 419, 427 
summary (table), 546 
three-dimensional, 254 
two-dimensional, 242 



Relaxation time, 45, 214 
medium, 214 
oscillator, 45 
Resolving Power 

Bandwidth Theorem, 376 
diffraction grating, 374 
Fabry-Perot interferometer, 343 
Rayleigh's Criterion (optical 
resolution), 375 

Scattering 
elastic, 447, 451 
inelastic, 451 

Schrodinger’s wave equation, 417 
Separation of variables (method of), 
245 

Shock waves, 506 
Simple harmonic motion, 1 
Skin depth, 211 
Snell's Law, 256, 309 
Solitons, 513 
instantons, 521 
KdV equation, 515, 517, 518 
Kerr optical effect, 522 
Miura’s transformation, 520 
multiplexing, 526 
non-linear waves, 514 
optical, 521 
Raman effect, 524 
Rayleigh’s solution, 513 
Schrodinger’s equation, 520 
shallow water waves, 513 
transparency in collisions, 518 
Sound waves, 151 
Spatial coherence criterion for 
interference, 360 
Standing wave, 

energy in harmonic mode, 127 
equation, 124 
on a string, 124 
ratio, 128 

Statistical Physics, 533 
black body radiation, 536 
boson, 533 

chemical potential, 539 
distributions, 

Bose-Einstein, 533, 536, 545 
classical, 537 
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degenerate, 538 
Fermi-Dirac, 533, 536, 544 
Maxwell-Boltzmann, 533, 536, 542 
entropy, 535 

equipartition of energy, 537 
Fermi energy level, 540 
fermion, 533 

Helmholtz free energy, 535, 539 
macrostate, 535 
microstate, 535 
phase space, 533, 534 
Planck’s radiation law, 536 
statistical weight, 535 
Structure of spectral lines, 340 
Superposition 

many simple harmonic motions, 20 
two perpendicular simple harmonic 
motions, 15 

two simple harmonic motions (one 
dimension), 12 

Telescope, problem 329 
Telescope resolution of double star, 385 
Thick lens, 320, 322 
Thin lens, 318 
Total internal reflection, 256 
Transient effect in a forced oscillator, 58, 74 
Transmission line, 171 
as a Filter, 179 
lossless, 173 
real, 183 

Umklapp process, 452 
Uncertainty Principle, 414 

Vector operator i, 53 
Velocity 

group, 109, 130 
particle, 109 
wave, 109, 114 
Vibration insulator, 64 
Visibility of interference fringes, 360 

Wave, 108 

current, 172, 174 
dispersion of, 131 
electromagnetic, 199 



electron in solids, 441 
energy density in, 126, 157 
equation, 97, 110 
evanescent, 256 
function, 418 
group 

many components, 132 
two components, 128 
guide, 242, 353 

in a periodic structure, 135, 162 
intensity, 120, 157, 208 
length, 113 
longitudinal, 151, 159 
mechanics, 411 
non-linear, 505, 513, 514 
plane, 109 
progressive, 108 

reflection at a boundary, 117, 163, 177, 217, 
254, 546 

standing, 124, 348 
three-dimensional, 247 
transmission at a boundary, 117, 163, 217, 
254, 546 
transverse, 108 
two-dimensional, 240 
velocity, 109, 113 
voltage, 174, 175 
Wave mechanics, 
atomic clock, 431 
electron waves in solids, 441 
Fermi energy level, 426 
harmonic oscillator, 438 
one-dimensional potential well, 419 
penetration of potential barrier, 430 
potential step, 426 
reflection and transmission of 
quantum particles, 420, 427 
Schrodinger’s wave equation, 417 
square potential well, 434 
three-dimensional box, 424 
wave function, 418 
zero point energy, 422, 438 

Young’s modulus, 159 

Zero point energy, 422, 438 
Zone plate, 403 




